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NON-SPECIFIC DNA AMPLIFICATION 

The present invention is a continuation-in-part of U.S. 
5 patent application for "DNA Amplification and Subtraction 
Techniques," Serial No. 208,512, filed June 17, 1988. 

Field of the Invention 

The present invention relates to a method of non-specific 
10 DNA amplification, and to applications of the method for cDNA 
selection . 
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Background of the Invention 

The ability to isolate and clone cDNA fragments which are 
20 homologous to contiguous genomic regions has important appli- 
cations in medicine. As one example, isolating cDNAs which 
are all homologous to a DNA section containing a known but as 
yet unlocalized disease-causing, gene would result in the 
isolation of that gene and flanking genes; this analysis would 
25 facilitate the process of localization and analysis of the 
disease-causing gene. . A second important application is the 
possibility of identifying new genes which are linked to known 
markers . 

As another example, homologies between two related virus - 
30 es may be utilized by binding to the genome of a first virus, 
cDNAs which are derived from a cell type infected with a 
second virus, then isolating those cDNA sequences which are 
homologous to the first viral genome. 
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Another hybridization method which has important applica- 
tions to medical research involves the selection of genomic or 
cDNA fragments which are either common or unique to two dif- 
ferent biological sources, e.g., cell types, or cell types at 
different stages of activation or development. 

Standard methods for obtaining • cDNAs on the basis of 
their hybridization to genomic DNA material are often limited 
in the amount of genomic DNA or cDNA material which is avail- 
able, particularly when it is desired to clone a large number 
of different-sequence fragments obtained from the hybridiza- 
tion step. This limitation can result in loss of a large 
number of fragments whose abundance in the hybridization 
elution mixture is very low. 

15 S umm ary of the Invention 

It is therefore one object of the invention to provide a 
method of cloning cDNA species which are homologous to a 
region of contiguous genomic DNA and are selected from a 
mixture of cDNA species. The method involves obtaining the 
20 region of interest of contiguous genomic DNA and preparing 
CDNA species which contain end-terminal priming sequences. 
Single-stranded cDNA species are isolated on the basis of 
their hybridization to the region of contiguous genomic DNA. 
These isolated single-stranded cDNA species are mixed with DNA 
25 polymerase, all four deoxyribonucleotide triphosphates, and 
primers homologous to the cDNA end-terminal priming sequences. 
This mixture is then reacted under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species. The amplified cDNA species resulting from this 
reaction are cloned into a vector. 

The method of the present invention can also include a 
second cDNA molecule isolation step. The single-stranded cDNA 
molecules are isolated on the basis of their homology to a 
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first set of selected genomic fragments and are then ampli- 
fied. The amplified cDNA species are denatured and a new set 
of single-stranded cDNA species are isolated on the basis of 
their hybridization to a second set of selected genomic f rag- 
5 ments. For example, the cDNA species may be rehybridized 
against the first set of selected genomic fragments for en- 
richment , or, hybridized against a different set of genomic 
fragments to select a subset of the cDNA species. 

In the preparation of cDNA species to contain end-ter- 
10 minal priming sequences, the above method may further include 
the amplification of the cDNA species* by sequence-independent 
amplification . 

The region of contiguous DNA in the above method may 
contain a known gene or region of interest. DNA fragments 
15 containing the known gene or region of interest may be ob- 
tained as follows. A DNA section containing the region of 
interest is size fractionated, the fraction containing the 
known coding sequence is identified, and the DNA from the 
identified fraction is isolated. One method to identify the 
20 DNA fragments of interest is by primer-initiated amplification 
in the presence of primers which are homologous to sequences 
in the region of interest. This isolated DNA can then be 
digested with a restriction endonuclease to cut the region of 
interest into a plurality of smaller fragments. 
25 In one embodiment of the above method, the contiguous 

genomic DNA section is bound to a solid support. 

It will be appreciated that the contiguous genomic DNA 
section can be contained- in a number of cloning vectors, known 
to one of ordinary skill in the art, for cloning large seg- 
30 ments of DNA, for example, yeast artificial chromosomes. 

One embodiment of the above method utilizes a contiguous 
genomic DNA section from the bovine leukosis virus genome and 
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a cDNA library made from mRNA obtained from a cell line in- 
fected with a virus derived from cell line 10C9. 

Another application of the above method includes the 
means for identifying cDNAs which correspond to genes located 
5 m the same chromosomal region as a known gene or region of 
interest. The fragments containing . the genomic DNA region 
containing a known gene or region can be obtained by prepara- 
tive size fractionation of genomic DNA fragments containing 
the known gene. One method to identify the DNA fragments of 
10 interest is by adding primers homologous to the known gene, 
DNA polymerase, and all four deoxyribonucleotides, to the gel 
matrix and treating the gel matrix under conditions which 
promote amplification of the region of the known gene defined 
by the primers. Linkers, which are useful as primers for 
15 sequence-independent amplification, are then ligated to the 
ends. Redundant linkers on the ends of the molecules can be 
removed by digesting with the appropriate restriction enzyme. 
. The DNA fragments are mixed with DNA polymerase, all four 
deoxyribonucleotides, and primers homologous to the linkers 
20 present on the ends of the DNA fragments, and reacted under 
conditions to produce sequence-independent amplification of 
the DNA fragments. 

The above described method can be used for genomic DNA 
regions containing such known genes as growth factor genes and 

25 growth factor receptor genes including the following: inter- 
leukins (e.g., IL-5 and IL-3) , GM-CSF, and erythropoietin. 
With each of these genes an appropriate cDNA library is cho- 
sen, such as, a T-cell cDNA library when using IL-5, or a 
renal cDNA library when using erythropoietin. 

30 These and other objects and features of the invention 

will be more fully understood when the following detailed 
description of the invention is read in conjunction with the 
accompanying drawings. 



WO 91/05861 PCT/US90/05938 



Brief Description of the Drawings 

Figure 1 is a flow diagram of the duplex amplification 
method of the invention; 

Figures 2A-2C show the sequences of three exemplary 
5 linkers used in practicing the invention, after attachment to 
opposite ends of blunt-ended duplex fragments (solid lines); 

Figure 3 illustrates one method for biotinylating ampli- 
fied cDNA fragments, in practicing the invention; 

Figure 4 is a flow diagram of the method of the invention 
10 for isolating DNA fragments which are unique to one fragment 
source; 

Figure 5 illustrates the isolation method shown in Figure 
4, applied to isolation of an RNA-derived sequence which is 
unique to individuals infected with a given viral agent; 
15 Figure 6 illustrates the amplification method of Figure 

l, t as it is used for confirming the presence of ET-NANB viral 
agent in bile from an infected animal, and for assaying the 
presence of the viral agent in infected individuals; 

Figure 7 shows regional assignments of potential growth 
20 factor/receptor genes within a subregion of chromosome 5. The 
following designations are used: 

CSF-GM, granulocyte-macrophage colony stimulating 

factor; 

CD14, encodes an antigen, which is a surface marker 

25 of monocytes; 

IL-3, multi-colony stimulating factor; 

IL4, B-cell growth factor; 

IL5, T-cell replacing factor; 

PDGFR, platelet-derived growth factor receptor; 

30 ADRBR, beta-2 adrenergic receptor; 

FGFA, fibroblast growth factor; 

CSF1R, CSF1 receptor (the protooncogene c-fms) 

CSF1, macrophage colony stimulating factor; 
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Figure 8 shows the steps in the preparation of small 

amplified genomic fragments derived from a region of human 
chromosome 5 containing the human IL-5 gene; 

Figures 9A and 9B show isolated genomic DNA from the 

5 HHW105 hamster/human hybrid cell line digested with NotI 

fractionated on a contour-clamped homogeneous electric field 

(CHEF) gel ( 9 B) , and sample products amplified with il-5 
specific primers (9B) ; 

Figure 10 shows Southern blots of fragments amplified by 
10 IL-5-specific primers, which were fractionated on an agarose 
gel; 

Figure 11 is an agarose gel fractionation of the non- 
specific amplification products of genomic fragments derived 
from the chromosome 5 region surrounding the IL-5 coding 
15 region; 

Figure 12 is an agarose gel fractionation of T-cell 
cDNA's which hybridized with the genomic DNA fragments from 
Figure 11; 

Figure 13 shows the hybridization of T-cell cDNA' s 
isolated on the basis of homology to the IL-5 region of human 
chromosome 5 to a variety of probes, including probes specific 
for actin, IL-3 and IL-5 coding regions.; 

Figure 14 shows the hybridization of T-cell cDNAs iso- 
lated on the basis of homology to the IL-3 region of human 
chromosome 5 to a variety of probes including probes specific 
for IL-3, GM-CSF and actin coding regions, and to probes for 
total human genomic DNA; 

Figure 15 shows long-range restriction maps, adapted 
from VanLeeuwen et al (1989), of the human chromosome 5 re- 
gions containing IL-5 and IL-3; and 

Figure 16 shows how the method of the invention may be 
applied to identifying specific disease-related loci and their 
gene products. 
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Detailed Description of the Invention 

I . Amplifying DNA Fragments 
A. Sources of DNA Fragments 
5 „ Figure 1 illustrates the method of amplifying duplex DNA 

fragments according to the invention-. The duplex fragments 
are typically present in a fragment mixture, and typically a 
mixture of cDNA fragments produced from messenger RNA (mRNA) 
transcript species, although genomic DNA fragments or linea- 
10 rlzed vectors or vector fragments are also suitable. 

Methods for isolating mRNA species from cellular or 
body-fluid samples, such as serum or bile, are well known. 
One method involves formation of a vanadyl-RNA complex, ex- 
traction of protein with chloroform/phenol, and precipitation 
15 with cold ethanol. In a second method, the RNA is extracted 
from a guanidinium isothiocyanate mixture with phenol, fol- 
lowed by a chloroform: isoamyl alcohol extraction, and precipi- 
tation of RNA from the aqueous phase with cold ethanol. The 
reader is referred to Maniatis, pp. 188-198, and references 
20 cited therein for details. 

As a rule, eukaryotic mRNAs are characterized by a 3' 
poly A terminal sequence which allows isolation by affinity 
chromatography, using oligo-dT or poly U bound to a solid 
support. In addition, or alternatively, total isolated RNA 
25 can be further fractionated by density gradient centrifuga- 
tion, or agarose gel electrophoresis, to obtain a desired size 
fraction of RNA species. 

Production of duplex cDNAs from the isolated mRNA tran- 
scripts is by conventional oligo dT or random prim ing of fir st 
30 strand synthesis. The former method is advantageous where 
duplex cDNAs derived only from poly A RNAs are desired. In 
this method, the RNA transcripts are primed with oligo dT, to 
promote first-strand cDNA synthesis by reverse transcriptase 
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in the presence of all four deoxynucleotides . The 3' hairpin 
formed from the first-strand synthesis product is used to 
prime the synthesis of the second strand by E. coli DNA poly- 
merase I, according to well-known methods (Maniatis, pp 
213216) . 

The random priming method of cDNA duplex formation is 
preferred where (a) duplex formation is not to be limited to 
poly A species, or (b) full-length duplex cDNAs are not re- 
quired. The method for first-strand cDNA synthesis utilizes 
an arbitrary sequence primer which is commercially available. 

Following second-strand synthesis, the cDNA fragments 
are preferably blunt-ended, for example, by filling in sticky 
ends with the large fragment of E. coli DNA polymerase I 
(Klenow fragment) as described below in the Materials section. 

Intact chromosomal DNAs can be isolated from agarose- 
immobilized cell preparations in those cases where chromosomal 
length DNAs are required. Genomic DNA from a selected cell 
source can also be isolated by standard procedures, which 
typically include successive phenol and phenol/chloroform 
extractions with ethanol precipitation. Genomic DNA fragments, 
either in a fragment mixture or a purified preparation, are 
also suitable for amplification, according to the method of 
the invention. The duplex DNA is fragmented preferably by 
partial or complete digestion with one or more selected re- 
striction endonucleases, although mechanical shearing may be 
employed. The fragmented genomic pieces may be size frac- 
tionated, or further treated to remove repetitive DNA. Other 
sources of double-stranded DNA fragments can include extra- 
chromosomal material, e.g., mitochondrial DNA, double-stranded 
DNA viruses, or viruses which have as part of their life cycle 
a double-stranded intermediate, e.g., a retrovirus. 

Cellular sources of genomic DNA fragments or RNA tran- 
scripts used for producing cDNA fragments include cultured 
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cell lines, or isolated cells or cell types obtained from 
tissue (or whole organs or entire organisms) . Cell sources 
are of interest in a variety of subtraction techniques where 
it is desired to identify or isolate particular RNA tran- 
5 scripts or genomic material which are unique to one of two 
related cell sources. Body-fluid sources of DNA and/or mKNA 
transcripts are of interest primarily where the fluid is known 
or suspected to contain a viral agent or other microbe of 
interest. Example 4, for instance, describes cDNA fragment 

10 mixtures produced from RNA isolated from bile, taken before 
and after infection of cynomolgus monkeys with enterically 
transmitted non-A/non-B (ET-NANB) hepatitis virus. 

Linearized or fragmented plasmid DNA, or fragmented 
phage DNA is another source of DNA fragments which one might 

15 ..y/ish to amplify. The vector DNA is obtained from purified 
plasmid or phage DNA according to conventional techniques, and 
linearized and/or fragmented by digestion with selected re- 
striction endonuclease (s) . Example 2 describes DNA fragments 
obtained by Hae lll digestion of phiX174 phage and linearized 

20 piAN13 plasmid. 

B. Fragment Linkers 

According to an important feature of the invention, the 
duplex DNA fragments to .be amplified a re ligated at their 

25 opposite end to^_lanker, to provide a priming sequence for 
strand - duplication, i.e. an end-terminal primer. The linker 
is preferably a short duplex DNA fragment, typically about 
20-30 basepairs in length, having a defined base pair 
sequence. One end of the linker is designed for enzymatic 

30 ligation to opposite fragment ends. Where the fragments are 
blunt-ended cDNAs or genomic fragments, this linker end is 
also blunt, for ligation to the fragment ends in the presence 
of T4 DNA ligase, as described in Example 2. Where the DNA 
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fragment mixture is formed by endonuclease digestion of 
genomic DNA, and produces fragments with sticky ends, these 
ends can either be filled in to generate blunt ends or the 
linkers can be prepared with complementary sticky ends at the 
5 fragment-ligation end. 

Since the ligation reaction involving the linkers also 
produces linker-to-linker ligation, the linkers are preferably 
designed so that dimers can be selectively cleaved by restric- 
tion endonuclease digestion. This can be done by constructing 
10 the blunt end of the linker with a sequence representing 
one-half of a selected restriction-site sequence. The re- 
striction site formed by blunt-end dimerization is preferably 
a rare cutter site, so that digestion of the fragments with 
the associated endonuclease, after linker addition to the DNA 
15 fragments, does not cleave a significant number of fragments 
internally, and thereby produce fragments with a linker at one 
end only. 

Figure 2 shows DNA fragments (solid lines) with exem- 
plary linkers A, B, or C attached to opposite fragment ends. 
20 As seen, each of the three linkers has one-half of an Nrui 
site at its blunt end, whereby linker dimers formed by blunt 
end ligation can be cleaved by Nru i digestion. It should be 
noted that linker ligation to the DNA fragments is direction- 
al, and linker self-ligation is limited to dimer formation, 
due to the one staggered end of the linker. The linkers are 
joined to blunt -ended fragments as indicated above, and as 
illustrated at the top in Figure 1. It will be appreciated 
that analogous linkers with one sticky end for ligation to 
corresponding fragment sticky ends could be similarly em- 
ployed. However, these linkers have the disadvantage that 
recleavage for removal of linker dimers is not possible. 

Also as seen in Figure 2, the linkers preferably include 
one or more internal restriction sites at which the ..fragments 
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can be cleaved after amplification. The internal linker 
restriction site(s) can serve three purposes. First, where 
the amplified fragments are used in the DNA subtraction method 
"described in Section D-l below, the restriction site can be 
5 used to cleave away a major portion of the linker in one of 
the two fragment mixtures, to prevent hybridization between 
linker regions in strands from the two different mixtures, 
where the same linker is used in the amplification of both 
fragment mixtures. Secondly, the linker cutting site(s) 
10 allows the amplified fragments (or the hybridized fragments in 
the Section Dl subtraction method) , to be equipped with de- 
. sired sticky end sites for cloning into a cloning vector . 
^Thirdly, as will be seen in Section D-l, the linker restric- 
tion site may be used to create a single-strand end, for 
15 '.purposes of biotinylating the fragments. In particular, an 
internal EcoR I site produces an AATT overhang, which can be 
filled in with commercially available biotinylated dUTP and 
dATP nucleotides . 

Synthetic duplex oligonucleotide linkers having selected 
20 sequences, such as those shown for the three exemplary linkers 
in Figure 2, can be prepared using commercially available 
automated oligonucleotide synthesizers. Alternatively, custom 
designed synthetic oligonucleotides may be purchased, for 
example, from Synthetic Genetics (San Diego, CA) . 

25 

C. Fragment Amplification 

The method steps described above are shown at the top in 
Figure 1. These steps • include obtaining duplex fragments, 
blunt-ending and attaching a linker to opposite fragments ends 
30 to produce linker-carrying . fragments, such as illustrated in 
Figure 2, and treating with a selected endonuclease to cut 
linker multimers at their blunt-end junctions- Alternatively, 
the linker dimers can be removed by fractionating _the DNA by 
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gel filtration or agarose gel electrophoresis, and eluting all 
DNAs larger than linker dimers . 

The linker-fragments from above are amplified by the 
repeated fragment duplication according to the following 
steps. First, the fragments are mixed with a large molar 
excess of a single-strand oligonucleotide primer, typically a 
10 6 -10 9 molar excess. The primer sequence is homologous to the 
fragment end linker; that is, the primer sequence and length 
is such as to promote hybridization to the complementary-lin- 
ker regions of the two strands under moderately stringent 
reannealing conditions . Another , requirement is that the 
primer, when hybridized to the linker region of the denatured 
fragment strands, be capable of priming polymerase-catalyzed 
strand replication; that is, the internal end of the primer 
15 provides a free 3'-0H. In the case of the linkers shown in 
Figure 2, preferred primer sequences are: 



10 



d(5'-GGAATTCGCGGCCGCTCG-3') for linker A- 

d<5' -GGTTGCGGCCGCTCG-3' ) for linker B; and 

20 d (5' -GACGCGTGAATTCTCG-3") for linker C. 

Single-strand oligonucleotide primers having the desired 
sequence are prepared by conventional methods or can be ob- 
tained from commercial laboratories, as above. 

25 The denatured fragments and primer are cooled, typically 

between about 37-60°C, to allow primer hybridization with the 
fragment linker ends. The cooling period is kept quite short, 
typically less than about- 5 minutes, to minimize strand-strand 
hybridization which would be expected on long-term reanneal- 

30 xng. At this point, or at the primer-addition stage, the four 
deoxynucleotides and a DNA polymerase capable of catalyzing 
second-strand, primed replication are added, and the reaction 
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mixture is brought to a temperature suitable for enzymatic 
strand replication. 

In the method described in Example 2 below, the deoxynu- 
cleotides and DNA polymerase were added prior to fragment 
5 denaturation. After heat denaturing, the mixture was annealed 
at 50°C for two minutes, then brought to 72°C for 5-12 minutes 
for primed, second-strand replication. The DNA polymerase 
used is Thermus aguaticus DNA polymerase (Taq DNA polymerase) , 
which is relatively heat-stable at up to 95°C for brief peri- 
10 ods (Kaledin et al; Perkin Elmer Cetus) . 

It will be appreciated from the- above, and from Figure 
,1, that the single added primer hybridizes with the 3' -end 
linker region in each DNA strand, and thus a doubling of 
fragment number occurs. The above replication procedure, 
15 which involves fragment denaturation by heating, cooling to 
form a fragment strand-primer complex, and second strand 
replication of the complex in the presence of DNA polymerase, 
is repeated until a desired concentration of fragments is 
achieved. In the above example, which employs a heat-stable 
20 Taq polymerase, the three replication steps are carried out 
simply by heating the fragment mixture to a denaturing tempe- 
rature (above the Tm of the fragments) , cooling briefly to 
allow fragment/ primer complex formation, and incubating for a 
period sufficient for second-strand synthesis. 
25 Example 2 illustrates the application of the fragment 

amplification method to a linearized DNA plasmid (piAN13) and 
to a. fragment mixture produced by Haelll digestion of phage 
phiX174 fragments. 

30 D. Utility 

The amplification method is useful for DNA duplex frag- 
ment amplification where limited amounts of genomic or cDNA 
fragment material is available and/or as a simple method of 
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amplifying duplex DNA material where DNA sequences are un- 
known. For example, in the probe binding study reported in 
Example 4, the method was used to increase by several orders 
of magnitude, the number of cDNA fragments derived from bile 
infected with enterically-transmitted non-A/non-B hepatitis 
(ET-NANB) viral agent. The amplification allowed the rela- 
tively small amount of viral agent present in the bile to be 
easily identified as an amplified cDNA fraction. 

The method is useful for amplifying the amount of a 
fragment or fragments available for cloning, to enhance clon- 
ing efficiency. One advantage of the method for cloning is 
the ability to provide the amplified fragments with end clon- 
ing sites suitable for the cloning vector which has been 
selected. 

The method can be applied to purified, enriched, or 
subfractionated DNA fragments, for increasing . the amount of 
fragment material available for further processing, e.g., 
further enrichment. For example, in the fragment-isolation 
method described in below, the isolated unique sequence frag- 
ments can be further enriched by additional amplification and 
selection. As another example, a DNA subtraction obtained by 
gel electrophoresis can be selectively amplified. Here the 
fragment mixture, prior to fractionation, is equipped with an 
end linker, and the selected fragment band is further iso- 
25 lated, either in situ within the gel, or after selective 
elution, by addition of primer, polymerase, and nucleotides, 
with, repeated heating and cooling replication steps. The 
following specific applications are illustrative. 



15 



20 



30 



1 . Isolating Unique Sequences 

(a) Hybridizing Fragment Mixtures 

Figure 4 is a flow diagram of the method for isolating 
unique sequences from one of two fragment mixtures, . according 
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to the invention. The two fragment mixtures are referred to 
herein as positive- and negative-source mixtures, and are 
derived from positive and negative sources of nucleic acid 
species, respectively. The positive source may be any cell, 
5 cell group, tissue, organ, subcellular fractions, or other 
defined source of RNA transcripts or genomic fragments, in- 
cluding vector fragments, which contain a number of different- 
sequence species. The negative source is similar to the posi- 
tive source, but lacks one or more positive source species. 
10 By way of example, the positive and negative sources may 

be cell or tissue samples which are infected (positive source) 
and non-infected (negative source) with a microbial agent. 
Here the nucleic acid species associated with the infecting 
agent, either in the form of genomic material or RNA tran- 
15 script species, are unique to the infected (positive) source. 
Similarly, the positive and negative sources may be a body 
fluid, such as serum or bile, from infected and non-infected 
subjects, respectively. Where the present method is applied 
to the study of hereditary diseases, the positive source is 
20 typically a normal cell line or tissue capable of producing 
the full range of normal proteins associated with that cell, 
and the negative source is a genetically altered cell which is 
suspected of an altered or lost mRNA species. That is, the 
positive source will yield a normal mRNA species which is not 
25 present in the genetically altered, negative source. 

As another example, it is known that malignant transfor- 
mation in cells may be triggered by the activation of onco- 
genes, as evidenced by expression of new RNA transcripts 
associated with the malignant state. In applying the method 
30 of the invention to isolating and identifying such RNA spe- 
cies, the malignant cells or tissue serves as the positive 
source of unique (tumor-related) RNA species, and normal, 
non-transformed cells or tissue, as the negative _ source . 



WO 91/05861 



PCT/US90/05938 



18 



10 



15 



20 



25 



30 



Other examples of negative and positive sources of transcripts 
or DNA duplex fragments are contemplated. 

The positive and negative sources shown in Figure 4 
yield mixtures of mRNA species, and these are converted to 
corresponding duplex DNA fragments as discussed in Section A 
above. The duplex fragments in the -two mixtures are blunt- 
ended and separately ligated at opposite fragment ends to a 
linker, as described in Section B. The linker attached to the 
positive- source fragment is preferably different than that 
attached to the negative-source fragments. For example, in 
the method outlined in Example 2, the positive source fig- 
ments are ligated to linker A in Figure 2, and the negative- 
source material, with linker C. Attaching linkers with dif- 
ferent sequences to the two fragment mixtures avoids the 
problem of linker hybridization when the two mixtures are 
hybridized. Alternatively, the same linker may be attached to 
both fragment mixtures, if the linker contains a restriction 
site which allows substantially all of the linker to be re- 
moved by restriction endonuclease digestion from the nega- 
tive-source fragment mixture before hybridization. 

Each fragment mixture is then amplified by repeated 
replications, as described in Section 1C above. in a 
preferred embodiment, the amplified, negative-source fragment 
strands are biotinylated or otherwise equipped with a ligand 
which can be used to selectively bind and remove strands or 
duplex species containing the ligand, by affinity chromatog- 
raphy, a variety of methods for biotinylating duplex fraa- 
ments are available. For example, biotin-labeled deoxynucleo- 
tides, such as Bio-ll-UTP and Bio-7-dATP, can be included in 
the amplification reaction mixture during the final rounds of 
replication. Duplex fragments are readily photobiotinylated 
by a standard procedure, as outlined in the protocol supplied 
by the manufacturer, Clontech (Palo Alto, CA) . ln one pr e- 



WO 91/05861 PCT/US90/05938 



19 

ferred method, the amplified fragments are treated with an 
endonuclease, such as Eco RI, which produces a 5' -AATT-3' 
sticky end. The sticky end is filled in with Klenow fragment 
in the presence of Bio-ll-UTP and/or Bio-7-dATP, both of which 
5 are commercially available. 

The two fragment mixtures are • now combined and heat- 
denatured to single strand form. Preferably the negative- 
source fragments are present in substantial molar excess, 
typically about a tenfold excess, to insure that non-unique 
10 fragments in the positive-source mixture have a high probabi- 
lity of hybridizing with complementary strands from the nega- 
tive-source fragments . 

The positive- and negative-source fragments are hybri- 
dized under conditions in which hybridization between the 
15 linkers in the two different mixtures does not occur. This 
may be accomplished, as noted above, by using differentse- 
quence linkers in the two mixtures, by cleaving the linkers 
from the negative- source fragment mixture prior to hybridiza- 
tion, or by hybridizing the fragment strands under conditions 
20 which preclude linker hybridization. Alternatively, the 
hybridization mixture may contain a large molar excess of 
primer oligonucleotides effective to hybridize with the posi- 
tive-source linker end regions, thus preventing inter-strand 
linker hybridization. 
25 The hybridization reaction may be carried out according 

to classical hybridization techniques, such as detailed in 
Hames, or more preferably, by rapid hybridization techniques, 
such as the phenol emulsion reassociation technique (PERT) , as 
described by Kohne, or the guanidinium thiocyanate method, as 
30 described by Thompson. 

In the PERT hybridization method, the fragments are 
hybridized at a final DNA concentration of preferably between 
about 5-50 ug/ml in a phenol reaction mixture, and the reac- 
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tion is carried out at 60-70°C for up to 5-6 hours. Alterna- 
tively, the reaction can be carried out at room temperature in 
the presence of formamide. The rate of annealing can be fol- 
lowed by hyperchromic shift at 280 nm or by diluting an ali- 
quot of the reaction mixture with buffer, and quickly passing 
the material through a hydroxyapatite .(HAP) column (Kohne) . 

In the guanidine thiocyanate method, the fragments are 
hybridized at a final DNA concentration of preferably between 
about 5-50. ug/ml in a 4M guanidine thiocyanate solution con- 
taining 8 mM DTT, and 20 mM Na citrate, pH 5.8. The mixture 
is heated to 65°C for 5 minutes, to insure complete denatura- 
tion, then incubated at 2 6°C for periods up to 72 hours, until 
complete hybridization has occurred (Thompson) . The formation 
of DNA/DNA hybrids can be confirmed as above. 

(b) Isolatin g Unigue-Seguence Fragments 

Following hybridization, the reaction components are 
contacted with a solid support effective to preferentially 
bind non-unique fragments. In a preferred embodiment, where 
the negative-source strands are biotinylated, the reaction 
components are separated on an avidin or streptavidin column 
which is effective to bind all biotinylated strands and non- 
biotinylated (positive-source) strands which are hybridized 
therewith. since the biotinylated strahds are present in 
large molar excess, virtually all common-sequence strands from 
the positive-source will be bound to a complementary, biotiny- 
lated strand, and thus removed by the affinity chromatography. 

Methods for separating the hybridization mixture on a 
streptavidin column are detailed in Example 3. Briefly, the 
hybridization mixture is passed through a column packed' with 
streptavidin-agarose and washed extensively with elution 
buffer. The eluates from several washes are pooled and the 
nucleic acid is concentrated by ethanol precipitation, yield- 
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ing a fragment mixture which is enriched for sequences which 
are unique to the positive-source material. 

According to an important feature of the invention, the 
fragments isolated will contain the end linkers originally 
5 attached to the positive-source fragments, and thus can be 
further amplified, by the amplification method described in 
Section C. Where the positive-source linker is different than 
that on the negative-source fragments, additional amplifica- 
tion (employing the "positive-source" primer) will selectively 

10 amplify positive-source fragments, providing further enrich- 
ment of sequences which are unique to the positive source. As 
indicated in Figure 4, the isolated amplified material may be 
rehybridized with the negative source fragments (which may be 
additionally amplified if necessary) , and unique fragments 

15 again isolated by affinity chromatography, to enhance enrich- 
ment. 

The final isolated fragments can be treated with DNA 
polymerase to ensure complete strand replication in the duplex 
species, then treated with a selected linker-site endonucle- 
20 ase, to provide fragment ends suitable for cloning or the 
like. Alternatively, or in addition, the isolated fragments 
can be radiolabeled according to known procedures, for use as 
a DNA probe. 

25 (2) Isolation and Cloning of Viral Agent 

Figure 5 illustrates one application of the subtraction 
method, for isolation and cloning of a viral agent present as 
RNA transcript in an infected source, such as selected tissue 
or body fluid (e.g. bile) . Here RNA mixtures isolated from 

30 infected (positive) and non-infected (negative) sources are 
used to produce cDNA duplex fragment mixtures, which are 
blunt-ended, ligated to a selected linker, and amplified as 
above. After biotinylating the negative-source fragments, the 
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two fragment mixtures are combined, denatured, reannealed, and 
separated by affinity chromatography (Section D-l) . The 
isolated fragments, which contain a selected restriction site 
in their end-linker regions, are cut at this site and cloned 
xn a suitable expression vector, such as the lambda gtn 
vector. The resulting lambda gtll phage constitute a fragment 
library enriched for sequences unique to the positive source 
material . 

To select for desired isolates which produce a viral 
antigen, host cells are infected with library phage, and 
reacted with antiserum from an infected individual, to bind 
virus-related antibodies to host cells producing viral anti- 
gens. The cells are then washed to remove unbound antibody 
and contacted with a reporter-labeled anti-human IgG antibody, 
to label cells which have bound virus-specific antibodies 
These cells are identified by the presence of the reporter 
The vector (s) containing a viral insert can be used for con- 
tinued antigen production, or as a source viral sequence 
probe. 

Alternatively, the cloned isolated fragments in the 
fragment library can be identified and selected using DNA 
probes. By way of illustration, the fragment library genera- 
ted as above can be replica plated and hybridized with labeled 
cDNA probes from infected, and non-infected sources. Probe 
hybridization can be carried out by conventional Southern 
blotting. Those library vectors which contain a viral-agent 
insert will hybridize with viral-agent cDNA probes present in 
the positive-source cDNA probes, but not with negative-source 
probes . 
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3 . Identification and Isolation of Cellular Nucleic Acid 
Products 

Methods similar to the DNA subtraction and probe selec- 
tion method just described can be applied to identifying and 
5 Isolating nucleic acid fragments which are unique to cells 
expressing oncogenes, or cells deficient in transcripts re- 
lated to genetic disease. 

Figure 6 illustrates how the amplification technique 
described in Section C can be used to confirm that the unique- 
10 sequence fragment clone (s) identified as above are in fact 
unique to the positive source. Here the two cDNA fragment 
fixtures prepared from positive and negative sources are 
Amplified, as in Section C, and fractionated by gel electro- 
phoresis. The size fractionated gel fragments are then trans- 
15 ierred by conventional Southern blotting to filters and hybri- 
dized with the isolated cloned insert of interest, after probe 
labeling. If the probe sequence is unique to the positive 
source fragments, hybridization will be seen with amplified 
positive-source fragments only; as indicated in Figure 6, the 
20 region of probe binding is fairly broad, as would be expected 
since the amplified fragments will have a range of sizes. 
This method of screening amplified cDNA fragments is detailed 
in Example 4, where the method has been used to confirm that 
an isolated ET-NANB fragment insert was unique to cDNA from an 
25 infected bile source. 

4 . Identifying cDNAs representing linked genes 

There is no general rule for the chromosomal distribu- 
tion of genes of like structure or function. However, gene 
30 duplication followed by divergence is a common mechanism for 
increasing the complexity and functional specialization of 
gene families. Although gene duplication could occur by 
transposition or retroposition, observations in a large number 
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of gene families suggest that duplications arose by tandemiza- 
tion of genes, perhaps as a result of unequal crossing over 
Tandemization would create closely linked clusters of genes 
which could then become separated by events such as chromosome 
breakage. However, even in cases where related gene are 
located on more than one chromosome, • as in the genes, of the 
complement system (Leonard et al.) or the genes for serum 
Apoproteins (Aegerter-Shaw) , subsets of the gene system may 
occur as closely linked genes. 

There are a large number of cases in which related genes 
are linked in clusters containing few unrelated intervening 
genes. Examples include genes for the related peptides of 
certain pituitary and placental hormones (D'Eustachio et al ) 
cluster of inunune response and class I major histocompatibi- 
lity genes (Barsh et al . ) , clusters of genes for some inter- 
ferons (Weiss et al.), genes for Ul and U2 RNAs (Erickson et 
al.), human metallothionein genes (Westin et al . ) , histone 
genes (Karin et al.), proline-rich salivary protein genes (Old 
et al.), genes for the constant regions of heavy or light 
chains of immunoglobins (Azen et al . ) , and alpha globin gene 
clusters (Tonegawa) . The two globin gene clusters are espe- 
cially relevant, because there are .linked embryonic genes 
within these clusters which have important functions in early 
development (the zeta gene within the alpha cluster and the 
epsilon gene within the beta cluster) . These would not have 
been readily detected by cross hybridization with the adult 
gene and therefore serve to illustrate the point that cross 
hybridization is not necessarily a good test of relatedness 
between clustered genes. Also relevant to this point are the 
problems encountered in attempting to isolate related lympho- 
kmes using cross-species homologies. Thus, the mouse IL-3 and 
GM-CSF genes share only limited homologies with their human 
counterparts and point to the possibility that growth factor 
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genes may have diverged very rapidly after tandemization (Wong 
et al., Yank et al . ) . 

In general, genes which have arisen by duplication of a 
common ancestor prior to or in the early phases of the mam- 
5 /malian radiation have diverged to the extent that they cannot 
be readily detected by cross-hybridization of nucleic acid 
.sequences. These observations indicate that, where divergent 
genes exist that arose by duplications of a common ancestor, 
moderate- to high- stringency hybridization approaches are 
10 probably not an adequate tool for their detection, whereas 
coding sequence comparisons between cDNAs may very well reveal 
; w functional similarities . 

The cDNA amplification method described herein makes 
I; possible a method to isolate cDNAs that map to specific large 
15 /VDNA fragments. Isolation of these cDNAs then allows limited 
a cDNA sequence analysis to determine whether they might encode 
interesting gene products and also provides probes to survey 
the normal tissue distribution of their corresponding tran- 
scripts. One application of this method has been directed to 
20 the isolation of new growth controlling genes. 

The isolation of new growth-controlling genes would have 
considerable significance in investigating the normal physio- 
logical function of growth factors and in the production of 
sufficient quantities of material for the assessment of poten- 
25 tial therapeutic applications. Figure 7 shows the regional 
assignments of potential growth factor/receptor genes within a 
subregion of chromosome 5. Five of these loci encode CSFs/in- 
terleukins. CSFs are a family of glycoproteins that are 
believed to be necessary for the growth and maturation of 
30 myeloid progenitor cells (Clark et al.-). GM-CSF has been 
localized to 5q23-31 (Huebner et al., LeBeau et al., 1986). 
IL-3 (multi-CSF) is located within 5q23-31 (LeBeau et al., 
1987). Initially, cDNA isolation efforts have been _ focused 
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predominately on the 5q23.3-q32 region of human chromosome 5 
which contains eosinophil colony stimulating factor, also 
called T-cell replacing factor, (interleukin 5: IL-5) and the 
B-cell growth factor (IL-4) (Le Beau et .al, VanLeeuwen et al) . 

A variety of sources can be used for the genomic DNA. A 
standard source is, of course, genomic DNA isolated from a 
tissue or a cell line. Hybrid cell lines also provide a con- 
venient source of genomic starting material. Large DNAs of 
greater than 50 kb can be separated by a number of techniques, 
including pulse field gel electrophoresis (Carle et al., Smith 
et al., Chu et al.), to provide an appropriate starting mate- 
rial which has been enriched for the target region. Another 
convenient source of genomic starting material can be genomic 
DNA banks created in a cloning vector. There exist a number 
15 of vectors, known to one of ordinary skill in the art, which 
are useful for cloning large pieces of DNA: such useful 
cloning vectors include modifications of phage lambda and 
yeast artificial chromosomes. Lambda vectors (eg., Proto- 
clone™ lambda gtlO, Promega) have been developed for creating 
genomic libraries which can tolerate relatively large inser- 
tions: for example, lambda gtlO can contain up to 7.6 kb 
inserts. Yeast artificial chromosomes (YAC) are in fact a 
preferred vector system since they provide the convenience of 
maintenance in a unicellular organism and have a capacity for 
very large inserts: up to several hundred kilobases (Burke et 
al.). in fact, the efficacy of the YAC system in the con- 
struction of a human genomic DNA bank has been demonstrated 
(Burke et al.) • 

A variety of methods to identify genomic DNA fragments 
30 of interest exists, such as hybridization with a labelled 
probe, but in many cases, are extremely limited by the availab- 
ility of sufficient quantities of target sequences for iden- 
tification and subsequent molecular manipulations. ...The ampli- 
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fication method described in Section C is of great value in 
this regard since it provides a method to amplify a group of 
DNA molecules independent of prior knowledge of their nucleic 
"acid sequence; an application of this method is presented in 
5 Example 5 . 

The overall scheme for purification of a genomic DNA 
fragment derived from the 5q23-31 region of human chromosome 5 
is shown in Figure 8 and described in detail in Example 5. 
The source of starting DNA was the hamster/human hybrid cell 
10 line HHW105 (Overhauser et al . ) ; this hybrid contains chromo- 
some 5 as its only human material. Accordingly, use of this 
hybrid cell line provided an immediate enrichment of human 
"chromosome 5 sequences. Initial pulse-field gel electrophore- 
sis (Burke et al . ) mapping using an IL-5 specific probe showed 
15 "a Not I IL-5-containing fragment of 500 kilobases (kb) which 
? was reduced by about 50 kb upon double-digestion with Nrul; 
this fragment was chosen as the genomic DNA target fragment. 

The HHW105 cell hybrid genomic DNA was cut with Not I; 
the resulting fragments were resolved on a CHEF gel (Smith et 
20 al., Chu et al . ) . Since CHEF gel tracks are straight rather 
than curved this gel system allowed the isolation of discrete 
size DNA fractions by horizontally slicing the gel. The IL-5 
containing fraction was identified (page 9) by taking a small 
sample of each gel slice and subjecting the sample to the 
25 polymerase chain reaction assay (Scharf et al.) using IL-5 

specific primers. 

Our modifications of the polymerase chain reaction 
conditions make it unnecessary to purify the DNA away from the 
agarose in the gel slice sample, unlike previous methods which 
30 employ a phenol/chloroform extraction step to eliminate the 
agarose. The ability to eliminate the phenol/ chloroform 
extraction step greatly facilitates performing the number of 
assays necessary to identify the genomic fragment of. interest. 
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An approximation of the degree of purification that this 

:i he°r ° f t gen ° miC ion affords can be made hy 

either estimating the amount of ethidium bromide fluorescent 
- the purged band vs. total, or by directly measuring the 
> amount of DNA nreseni- tk . . • . 9 tne 

Present m the size fraction by spectrof luoro- 
metry. Human chromosome S constitutes- about 5%, by length, of 
the human genome and is present at one copy per genome in the 

in th h T 05 ^ 2.5% of all DNA present 

in the hybrid genome (Wasmuth personal communication,. The 
following calculations can be made to assess the degree of 
Purification required to isolate one specific large fragment 
of human DNA . T he diploid genome size of human and hamsteT T. 

X 10 base pairs (bp) . Therefore, in HHW105 about 1 5 x Iff 
bp of DNA will be contained in chromosome S (2.5% of 6 X 10' 

Too\T S T" g " aVera5e — fra9ment le " 9th ° f -PP-ximately 
500 kb, chromosome 5 would contain 300 Nptl fragments; accord- 
ingly, about a 300-fold purification would be required to 
isolate only one of these fragments. An 80-fold purification 
is readily obtained on one pulsed-field gel (Michiel et al, 
and^two gel purification steps can raise this to at least 300- 

Nrul .I 1 :' 5 ' 00 ™*™ 1 * 9 ~ ^omic fragment was cut with 

firul and this sample resolved on a second CHEF gel. The gel 

to7oi i0ed " ab ° ve and 311 aliquot of each 9el «*i-«.d 

to polymerase chain reaction amplification with il-5 specific 
primers. These ampiified samples were then size fractionated 
on a conventional agarose gel and the DNA molecules trans- 
ferred to nitrocellulose. l„ this case the IL-5-containing 
fraction was identified by probing the nitrocellulose paper 
With a radiolabeled IL-5 specific probe (Figure 10) 

hamste 11 "; IL - 5 -°° ntainin 9 "otVNruI fragment and contaminating 
hamster fragments were further size fractionated, the A li„! 
*ers (see Section B, ligated to the molecules, and the DNA 
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fragments amplified by the method described in Section c. 
These amplified fragments were then size fractionated on a 
conventional agarose gel (Figure 11) and transferred to a 
nitrocellulose filter. A variety of solid supports are avail- 
able which will perform the same function as the nitrocellu- 
lose filter such as GeneScreen™ (DuPont-New England Nuclear) 
or the solid support described above in section 2A(ii) . 

Example 7 describes the selection of cDNA molecules 
which are "homologous to the genomic regions isolated in Ex- 
ample 5. The cDNA molecules were selected from a T-cell cDNA 
library (Example 6) . cDNA molecules can be amplified before 
•and/or after selection. For example, vector sequences flank- 
ing the 5' and 3' end of the cDNA inserts can be used as end- 
terminal primers to amplify the cDNA inserts before any selec- 
15 tion The 5' ends of the primers can also contain sequences 
encoding restriction sites which will subsequently simplify 
insertion of the cDNA molecules into cloning/expression vec- 
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Although a cDNA library was used in the IL-5 example 
(Example 7), the initial nucleic acid need not be a cDNA. It 
could, for example, be an RNA molecule where, after the mi 
tial selection by hybridization from a population of molec 
ules a first strand of homologous DNA is synthesized by a 
known procedure, such as random priming (Boehringer-Mannheim 
kit) or oligo-dT priming (Maniatis et al . ) , and second strand 
synthesis can be primed in the classic hairpin-primed fashion 

(Maniatis et al.) . t 

Extensive washing • of the filter, after hybridization, 
using relatively stringent conditions was performed to mini- 
mize non-specific binding.. Since different linkers were used 
for amplification of the selected cDNAs versus the genomic 
sequences (the C and A primers, respectively, described in 
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Section B ), any genomic sequences which come off the filter 
during the elution steps will not be amplified 

This selection technique is rapid and, to some extent 
serves to equalize the abundance of selected cDNAs . This 
equalization occurs because the copy number of the genomic 
exon sequences on the filter, should- be roughly equivalent, 
despite differing abundances of the cDNAs homologous to each 
to these genomic sequences. Thus, when the cDNAs are in 
excess, a normalization should occur relative to the genomic 
10 sequences. 

In the case of IL-5, Figure 12 shows the ethidium bro- 
mide staining of an agarose gel on which the amplified cDNA 
molecules, which were selected by the above described method, 
were resolved. Further, Figure 13 shows the hybridization of 
various probes with the amplified cDNAs . As can be seen from 
the figure, IL-5 specific sequences are present as expected. 
Further, a smear of hybridization corresponding to cDNA se- 
quences homologous to total human DNA is also present suggest - 
mg a variety of cDNA molecules other than IL-5 have been 
isolated by this method. 
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The method applied to the isolation of cDNAs homologous 
to the IL-5 containing region of chromosome 5 has also been 
applied to the IL-3 (Figure 7) containing region of chromosome 
5 As can be seen from Figure 14, the selection method was 
also efficacious for the isolation of cDNAs corresponding to 
the IL-3 region of chromosome 5. Using the genomic DNA clones 
we obtained from the IL-3 region we constructed a long-ranae 
restriction map of this region. Using a probe specific for GM- 
CSF we physically mapped GM-CSF to be within 10 kilobases of 
xL-3 (Figure 15). when cDNAs , selected on the basis of their 
hybr ldlzation to the IL _ 3 reg . on genom . c p ^ 

tested for their hybridization to a GM-CSF specific probe a 
positive signal was obtained. This result further demonstrates 
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the ability of the instant method to identify cDNAs present in 
the same chromosomal region as a selected gene of interest 

(such as IL-3) . 

One important consideration in the above selection 
5 technique is the quenching of genomic repetitive sequences 
within the various cloned DNAs . Quenching can be achieved by 
one or both of the following treatments: 1) pre-hybridization 
of the filter; and 2) pre-hybridization of the cDNA library in 
solution before the filter selection step. Both of these 
10 approaches tie up the repetitive sequences and prevent them 
from acting as a homologous hybridization site in the selec- 
tion step (e.g. the hybridization of the filter with the cDNA 
library) . Effective quenching can be achieved in the follow- 
ing ways : 

15 One approach to quenching is pre-hybridization to an 

intermediate C 0 t value (Litt et al . , Sealey et al.) with total 

randomly sheared human genomic DNA. 

Another approach to quenching genomic repeats is to use 

a chromosome-specific genomic DNA library from a chromosome 
20 other than the target chromosome for pre-hybridization: in the 

case of IL-5, a human chromosome other than human chromosome 

5. 

Finally, the most preferred approach to quenching geno- 
mic repeats is to use a library of isolated repetitive sequen- 

25 ces representative of various abundance classes of repeat 
sequences. This set of repeats is isolated by screening a 
human genomic DNA library with total radiolabeled human DNA at 
a low plaque density. A mixture of plaques which show low, 
intermediate, and high density signals are picked and these 

30 sets of sequences can be used as the blocking reagent for pre- 
hybridization. Alternatively, such a library can be generated 
by shearing total human genomic DNA into short lengths, dena- 
turing and reannealing to an intermediate C„t value, .and clon- 
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ing the repaired double-stranded molecules. In general an 
xnitial blocking of the filter with sonicated herring sperm 
DNA is followed by P re-hybridi Z ation blocking using the repeat 
library. 

To improve the representation of cDNAs corresponding to 
a given genomic region, a further step of hybridization selec- 
tion is performed employing, for example: (1) a different 
starting combination of restriction enzymes used to generate 
the partially purified genomic size fraction; and/or (2) a 
different source of the genomic DNA region of interest (eg a 
yeast artificial chromosome clone that contains the appropri- 
ate genomic region). The set of eluted cDNAs, from above, 
would be hybridized to these sequences. In this way the chan- 
ces of re-electing contaminating cDNAs would be minimized and 
the chances of isolating only cDNAs from a specific region 
would be maximized. 
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Once cDNA molecules homologous to a selected chromosomal 
regxon have been isolated, the cDNAs are cloned (Protoclone™ 
lambda gtlO) and plated at low plaque density. The phage DNAs 
are transferred to filters and hybridized (Woo) with a label- 
led repetitive sequence library or with total radiolabeled 
human DNA to distinguish between clones which contain repeat 
sequences from clones which contain unique sequences. DNA 
then isolated from a representative sample of the unique 
sequence clones (5 00 clones in the case of IL-5) and the cDNA 
inserts isolated by EcoRI digestion. Aliquots of the cDNA 
inserts are spotted to nitrocellulose filters. Then the cDNA 
inserts are nick-translated and used to probe the nitrocellu- 
lose filters to identify duplicate clones (Southern); dupli- 
cates are then discarded. 

The clones are then mapped to chromosome 5 using a mini- 
panel of the hybrid HHW105 DNA, hamster DNA, and human DNA A 
large collection of various digest of these DNAs as Southern 
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blots have been prepared on nylon filters. Clones that map to 
chromosome 5 and are unique can be positioned on the map of 
the genomic DNA surrounding the IL-5 gene utilizing hybridiza- 
tion back to the large DNA fragments that were originally 
5 resolved by CHEF gel electrophoresis. Clones which are found 
to contain repeats will be pre-guenched or subcloned prior to 

further analysis. 

This method provides a novel and efficient way to iden- 
tify cDNAs corresponding to a contiguous region of genomic DNA 
10 when conventional probes are unavailable, and also the method 
provides a means to identify previously unknown gene products 
of potential research. and therapeutic value. 

5 . I, identifying Homologous Viral Nucleic Acid Sequences 
15 t The methods of the instant invention are useful for the 

analysis of viral genome structure and function. Complete DNA 
copies of viral genomes can be used as probes or, alternately, 
selected fragments of the genome can be utilized. In the case 
of RNA viruses DNA copies of the genome can be synthesized, as 
20 by reverse transcriptase, or in some cases intermediate DNA 
forms of the virus may be isolated (Kashmiri et al . ) . Im- 
mobilized viral genomic DNA can be used to screen the nucleic 
acids of infected cells: for example, cDNA libraries may be 
made from the RNAs of infected cells and these cDNAs isolated 
25 on the basis of their sequence homology to the viral genomic 
DNA probe. The ability to amplify cDNA molecules independent 
of their specific sequence, that is, by use of the linkers 
(Figure 2) of the instant invention (e.g. Example 6), is 
helpful in the identification of normally low abundance mes- 
30 sages. Further, the ability to amplify the cDNAs eluted from 
the viral genomic DNA probe provides another increase in the 
capability of the method to identify cDNAs having infrequent 
hybridizations. — . - 
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The isolated cDNAs can then be mapped to- the viral 
genome to assign coding regions. This method can also be used 
to isolate cellular proteins which are related to the viral 

genomic sequences . 

The methods of the instant invention can also be applied 
to the analysis of the nucleic acid sequences of heterologous 
vxruses. For example, a related virus can be used to help 
characterized a new virus and identify related coding sequen- 
ces. This approach has been used to examine relationships 
between the bovine leukosis virus (BLV) and a newly isolated 
human-lymphoma associated virus (HuLAV, co-pending U.S. ap- 
plication 361,855, herein incorporated by reference). The 
structural proteins of HuLAV viruses demonstrate a high degree 
of homology to _ other known retroviruses; in particular, the 
seroreactivity of individuals seropositive for the 10C9 HuLAV 
(the HuLAV carried in cell line 10C9) with BLV envelope pro- 
tein suggests that the putative envelope protein of HuLAV 10C9 
may share immunogenic epitopes with BLV. 

A cDNA library prepared from HuLAV 10C9 particles <E- 
xample 9-B) was hybridized against a BLV genomic DNA probe 
bound to a nitrocellulose filter, the filter washed, the 
selected cDNAs eluted and cloned into lambda gtll. No cDNA 
molecules were selected which gave a positive signal when the 
lambaa gtll clones were immunologically screened with anti-BLV 
sera or anti-gp62 sera (gp62 is an envelope protein of another 
related retrovirus, HTLV-I; Kiyokawa et al) . 

However, HuLAV 10C9 cDNAs isolated by the methods of the 
xnstant invention, as described in Examples 9-10, yielded the 
following results: hundreds of lambda gtll cDNA clones posi- 
tive with anti-BLV; 15 clones positive with ant±- P 2* ( P 24 is 
an approximately 27,000 molecular weight HuLAV core protein) 
of which 4/15 cross-reacted with anti-BLV; a*d 4 positive 
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clones detected with anti-gp62 of which all four cross-reacted 
with anti-BLV. 

Accordingly, use of the instant methods applied to the 
isolation of homologous nucleic acid sequences between the 
related but distinct viruses BLV and HuLAV resulted in abi- 
lity to detect homologous nucleic acid sequences where stan- 
dard technology had failed. This increased ability was evi- 
denced by the immunological reactivities of the lambda gtll 
expression products of the cloned selected cDNAs . Further, 
when the selected cDNAs were cloned in lambda gtlO and then 
screened with full length BLV probes, 76 positives were detec- 
ted out of 2 X 10 4 clones screened. These results illustrate 
the increased sensitivity which can be achieved by employing 
/the methods of the instant invention. 



6. Enrichment of Selected Genomic DNA Fragments 

One difficulty in mammalian genetics is having adequate 
amounts of starting material available for analysis. The pre- 
sent amplification method provides a straight-forward method 
20 to generate a library which is highly enriched in sequences 
derived from a selected genomic region independent of dif- 
ferent sequence species. Genomic DNA fragments may be isolated 
from a contiguous genomic DNA section, eg. by size fractiona- 
tion on a gel, fragmented to form fragments of less. than about 
25 2 kilobases in size, the linkers of the instant invention 
ligated onto the fragments, and the fragments amplified. As 
illustrated above for the IL-5 region of chromosomes, genomic 
sequences can be isolated for a region of interest. These 
genomic sequences can then be amplified independently of the 
30 specific fragment sequence . using the linkers of the instant 
invention (Example 5-E) and then subcloned into an appropriate 
vector (eg. lambda gtlO, Protoclone™, Promega) . These cloned 
genomic sequences are then available for a variety_of manipu- 
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lations such as use as probes (Examples 7-8, 9-10), especially 
in genetic linkage analysis, and the generation of long-range 
restriction maps. Such long-range restriction maps are con- 
structed by restriction digestion of the various clones, size 
fractionation of their fragments, construction of restriction 
maps for each clone, and finally lining up overlapping regions 
between clones. Libraries generated in this way provide 
excellent representation of the genomic region of interest. 
1 ' Identifi cation of Disease-related Loci 

The molecular identification and analysis of genes 
corresponding to disease-related loci has been a major chal- 
lenge of molecular genetics. Genetic linkage data exists fo- 
a number of genetic diseases (e.g. familial, schizophrenia 
(Sherrington et al.) and in some cases physical maps of the 
specific chromosomal regions are available (e.g. the cystic 
fibrosis locus has been mapped to an approximately 500 kb 
region (Poustka et al . ) ) , growth factors involved in certain 
neoplasms such as the familial polyposis locus isolated to a 
small region of human chromosome 4 (Bodmer et al . ) . The 
method of the invention may be useful to aid in- the identifi- 
cation of specific disease-related gene products. Figure 16 
provides a general outline of approaches which may be taken 
leading to the identification of specific disease-related loci 
and their gene products. 

Using the techniques outlined above for the isolation of 
genomic DNA fragments, long range restriction maps can be 
constructed of the genomic DNA region corresponding to a 
disease-related locus. a number of potential starting mate- 
rials containing the genomic DNA of interest are available as 
discussed in section II-B, such as, yeast artificial chromo- 
somes, hybrid cell lines, and DNA fragments obtained by 
chromosome jumping (Collins et al . ) . Then, by application of 
the techniques used for the isolation of the genomic DNA in 
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the region of IL-5, known genetic markers for the regions of 
disease related loci can likewise be used to isolate genomic 
DNAs,, from these regions and then these selected genomic DNAs 
can be used as probes to select corresponding cDNA molecules 
5 from normal and disease-state samples. 

Candidate cDNAs from the region- of interest can then be 
isolated and compared by a variety of methods to determine 
differences between normal and disease state samples. One 
approach is the use of the subtraction method described above 

10 to detect cDNAs which are present in one sample, but not 
present in another. Another approach ■ is to directly select a 
small set of cDNAs using the technology described above. To 
identify specific mutations in the isolated cDNAs a number of 
systems can be used including the following: (1) the use of 

15 modified denaturing gradient polyacrylamide gels to identify 
single base pair alterations, and (2) the use of incorporated 
biotinylated nucleotides to mark the positions of mutations 
determined by comparison to wild type DNAs. Also, partial DNA 
sequence analysis can be used to facilitate matching and 

20 comparisons of candidate cDNAs obtained, from different sam- 
ples, as well as to eliminate candidates based on their homol- 
ogy to known sequences present in sequence data bases (e.g. 
N . I . H . GeneBank) . 

The following examples illustrate the method of fragment 

25 amplification and isolation, and of the cDNA isolations de- 
scribed above, but are in no way intended to limit the scope 
of the method or its application. 

Example 1 

30 General Procedures and Materials 

Plasmid piAN13 was obtained from Dr. Brian Seed (MIT) . 
E. coli strain KM392, a suppressor positive strain, .was ob- 
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tained from Dr. Kevin Moore, DNAX, Palo Alto, CA. E coli 
strain LG75, a suppressor minus strain, was obtained from Dr 

7 SeSd (MIT) " L ^ da Ch21a was obtained from 

Frederxck Blattner (University of Wisconsin) , and a NotI site 
was inserted between the E^oRi and Hindlll sites (Ch 2 I^j, , b y 
replacing the 2 kb EcpRi/ fragment ^ ^ * 

cleotide containing a Noti site. E. coli strain MC1061 har- 
boring the P3 plasmid was obtained from Dr. Brian Seed (MIT) ' 

Terminal transferase (calf thymus), alkaline phosphatase 
(calf xntestine), polynucleotide kinase, E . coli DNA poly- 
merase I (Klenow fragment), and SI nuclease ~weTe~obtained from 
Boehnnger Mannheim Biochemicals (Indianapolis, IN); P hiX174 
rragments, produced by digesting the phiX174 phage to comple- 
15 sZ M^)^ ~ ZI1 ' WSre ° btained fr ° m BRL laboratories (Bethe- 

imal, EcoRI, Not!, T4 DNA ligase and T4 DNA polymerase 
were obtained from New England Biolabs (Beverly, MA); and 
streptavidin agarose, from Bethesda Research Labs (Bethesda, 
Low-gelling temperature agarose (Sea Plaque) was ob- 
tained from FMC (Rockland, ME) . Nitrocellulose filters were 
obtained from Schleicher and Schuell . 

Synthetic oligonucleotide linkers and primers were 
prepared using commercially available automated oligonucleo- 
tide synthesizers. Alternatively, custom designed synthetic 
oligonucleotides may be purchased, for example, from Synthetic 
Genetics (San Diego, CA) . cDNA synthesis kit and random prim- 
ing ^labeling kits were obtained from Boehringer -Mannheim 
Biochemical (BMB, Indianapolis, IN) . 

Kinasing of single strands prior to annealing or for 
labeling is achieved using, an excess, e.g., approximately 10 
units of polynucleotide kinase to 1 nmole substrate in the 
Presence of 50 mM Tris, P H 7.6, 10 mM MgCl 2 , 5 mM dithioth- 
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reitol, 1-2 mM ATP, 1.7 pmoles gamma 32 P-ATP (2.9 mCi/mmole) , 
0.1 mM spermidine, 0.1 mM EDTA. 

Site-specific DNA cleavage is performed by treating with 
the suitable restriction enzyme (or enzymes) under conditions 
5 which are generally understood in the art, and the particulars 
of which are specified by the manufacturer of these commer- 
cially available restriction enzymes (see, e.g., New England 
Biolabs, Product Catalog) . In general, about 1 ^1 of plasmid 
or DNA sequence is cleaved by one unit of enzyme in about 2 0 
10 \il of buffer solution after 1 hr digestion at 37°C; in the 
examples herein, typically, an excess of restriction enzyme is 
used to insure complete digestion of the DNA substrate. 

Incubation times of about one hour to two hours at about 
37?C are workable, although variations can be easily tolera- 
15 ted. After each incubation, protein is removed by extraction 
with phenol/chloroform, and may be followed by. ether extrac- 
tion, and the nucleic acid recovered from aqueous fractions by 
precipitation with ethanol (70%) . If desired, size separation 
of the cleaved fragments may be performed by polyacrylamide 
20 gel or agarose gel electrophoresis using standard techniques. 
A general description of size separations is found in Methods 
in Enzvmology (1980) 65:499-560. 

Restriction cleaved fragments may be blunt-ended by 
treating with the large fragment of E. coli DNA polymerase I 
25 (Klenow reagent) in the presence of the four deoxynucleotide 
triphosphates (dNTPs) using incubation times of about 15 to 25 
min at 20°C to 25°C in 50 mM Tris pH 7.6, 50 mM NaCI, .6 mM 
MgCl 2 6 mM DTT and 0.1-1.0 mM dNTPs. The Klenow fragment 
fills in at 5' single-stranded overhangs in the presence of 
30 the four nucleotides. If . desired, selective repair can be 
performed by supplying only one of the, or selected, dNTPs 
within the limitations dictated by the nature of the overhang. 
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After treatment with Klenow reagent, the mixture is extracted 
with phenol/chloroform and ethanol precipitated. Treatment 
under appropriate conditions with SI nuclease results in 
hydrolysis of any single-stranded portions of DNA. in par- 
ticular, the nicking of 5' hairpins formed on synthesis of 
cDNA is achieved. 

Ligations are performed in 15-50 m volumes under, the 
following standard conditions and temperatures: for example, 
20 mM Tris-Cl pH 7.5, 10 mM MgCl 2 , 10 mM DTT, 33 mg/ml BSA, 10 
mM-50 mM NaCl, and either 40 mM ATP, 0.01-0.02 (Weiss) units 
T4 DNA ligase at 14°C (for "sticky end" ligation) or 1 mM ATP, 
0.3-0.6 (Weiss) units T4 DNA ligase at 14°C (for "blunt end" 
ligation) . Intermodular "sticky end" ligations are usually 
performed at 33-100 mg/ ml total DNA concentrations (5-100 nM 
15 total end concentration) . Intermolecular blunt end ligations 
are performed at 1 mM total ends concentration. 

In vector constructions employing "vector fragments", 
the vector fragment is commonly treated with bacterial alka- 
line phosphatase (BAP ) or calf intestinal alkaline phosphatase 
20 (CIP) in order to remove the 5'. phosphate and prevent self-li- 
gation of the vector. Digestions are conducted at P H 8 in 
approximately 10 mM TrisHCl, 1 mM EDTA using about 1 unit per 
mg of BAP at 60°C for one hour or 1 unit or CIP per mg of 
vector at 37°C for about one hour. in order to recover the 
25 nucleic acid fragments, the preparation is extracted with 
Phenol/ chloroform and ethanol precipitated. Alternatively, 
religation can be prevented in vectors which have been double 
digested by additional restriction enzyme digestion and separ- 
ation of the unwanted fragments. 
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Example 2 
Amplifying Duplex DNA Fragments 
A. Modification and Amplification of piANl3 plasmid 

Plasmid piAN13 carries a supF suppressor gene capable of 
5 suppressing an amber mutation. The plasmid was linearized by 
digestion with Sma l and ligated to the linker whose sequence 
is shown at A in Figure 2, at a 1:100 molar ratio in the 
presence of 0.3-0.6 Weiss units of T4 DNA ligase. The linker 
sticky ends were filled in with Klenow fragment, conventional- 
10 ly, and the mixture was treated with Nrul to cleave linker 
dimers. Sma l linkers were attached to the blunt-ends of the 
plasmid fragments under ligation conditions like those above. 
After-treatment with Sma l to remove redundant Sma l linkers, 
the fragments were recircularized. This was done at a frag- 
15 ment concentration of about 1 ng/ml, in the presence of 4,000 
units/'ml DNA ligase, at about 14°C for 18 hours. The result- 
ing plasmids were thus modified to contain an A-sequence 
linker on either side of the original Sma l site. 

E. coli strain MC10 61 harboring the P3 plasmid was 
20 transformed with the recircularized plasmid, and selection for 
successful plasmids was made on the basis of bacterial growth 
in the presence of kanamycin, ampicillin, and tetracycline. 
Plasmids were isolated from the resistant bacterial colonies, 
and cut with Sma l to produce blunt-end, linearized fragments 
25 having the desired A-sequence end linkers. 

To 100 of 10 mM Tris-Cl buffer, pH 8.3, containing 

1.5 mM MgCl 2 (Buffer A) was added 1 x 10-3 ^g of the linearized 
plasmid, 2 nM of a primer having the sequence 
d(5' -GGAATTCGCGGCCGCTCG-3' ) , 200 \M each of dATP, dCTP, dGTP, 
30 and dTTP, and 5 units of Thermus aguaticusr DNA polymerase (Taq 
polymerase) . The reaction mixture was heated to 94°C for 1 
minute for denaturation, allowed to. cool to 50°C for 2 min 
for primer annealing, and then heated to 72°C for 5-12 min to 



WO 91/05861 



PCT/US90/05938 



42 



10 



allow for primer extension by Taq polymerase. The replication 
reaction, involving successive heating, cooling, and polyme- 
rase reaction, was repeated an additional 25 times with the 
aid of a Perkin-Elmer Cetus DNA thermal cycler. 

B. Amplification of phiX174 Fragments 

To a 50 jil solution containing 100 ng of the Haelli 
blunt-end fragments was added 30 0 ng of the linker whose 
sequence is shown at C in Figure 2, with ligation of the 
linker in the presence of DNA ligase being carried out as 
above. The mixture was treated with Nrui to cleave linker 
dimers . 

To 10 0 nl of Buffer A was added 1 ng of the 
P hiX174/HaeIII fragments, 2 of a primer having the sequence 
15 d(5'-CACGCGTGAATTCTCG-3'), 200 jiM each of dATP , dCTP, dGTP, 
and dTTP and 5 units of Tag DNA polymerase. . The reaction 
mixture was heated to 94°C for 1 minute for denaturation, 
allowed to cool to 50°C for 2 min for primer annealing, and 
then heated to 72°C for 5-12 min to allow for primer extension 
20 by Taq polymerase. The replication reaction steps were repea- 
ted 25 times, as above. The amplified .fragments are referred 
to below as phiX/linker-C fragments. 

A second preparation of phiX174 fragments was amplified 
by similar methods, but using the seguence-A linker employed 
25 in the piAN13 amplification. This preparation is referred" to 
below as phiX/linker-A fragments. 
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' Example 3 
Selection for piAN13 Sequences 
A. Biotinylation of Amplified phiX174 Fragments 

The amplified phiX174/linker-C fragment produced in 
Example 2-B are treated with EcoRi to yield 5' protrudina ends 
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having an AATT sequence overhang. The fragments are biotiny- 
lated by Klenow fill-in reaction according to known procedures 
(.Maniatis, pp. 113). Briefly, the reaction mixture consists 
of 100 ^g/ml fragments in 100 mM potassium phosphate (pH 7.2), 
5 2 mM CoCl 2 , 0 . 2 mM DTT, 40 ]M Bio-7-dATP, 100 dTTP, and 100 
units/ml Klenow fragment. After incubation at 25°C for 45 
minutes, the reaction is terminated by heating at 65°C for 10 
minutes. 

10 B. Hybridization Reaction 

Amplified piAN13 vector is mixed at various mole ratios 
with the phiX/Linker-A fragments produced in Example 2-B. 
Mixtures contained weight ratios of 1:100, 1:30 and 1:3 piAN13 
to phiX/Linker-A fragments are prepared. Each mixture is then 

15 mixed with a 10-fold molar excess of biqtinylated 
phiX/Linker-C fragments prepared as above, in an annealing 
buffer containing 2M guanidinium isothiocyanate (pH 5.8). The 
mixtures are heated to 65°C for 5 minutes, then allowed to 
cool to 25°C and hybridized for a 12-hour period. 

20 

C. Separation by Affinity Chromatography 

A 1 ml siliconized syringe plugged with siliconized 
glass wool is packed with 0.5 ml streptavidin-agarose and 
washed with lOmM Tris-Cl, 0.5 M NaCl, pH 7.0, containing 1 mM 

25 EDTA (elution buffer) . A portion of each of the three hybri- 
dization mixtures from above is EtOH precipitated (70%) , 
resuspended in 10 mM Tris HC1, 1 mM EDTA pH 7.0, and then 
loaded onto the column which is washed with several volumes of 
column elution buffer. 

30 For each mixture, the eluted fractions from the strep- 

tavidin column are combined and the DNA concentrated by etha- . 
nol precipitation. The DNA is resuspended in Buffer A to a 
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concentration of 0.01 ng/ml, and mixed with 2 ^ Qf ^ 
primer having the sequence d <5' -GGAATTCCGGCCGCTCG-3 ' ) , and 200 
UM each of dATP, dCTP, dGTP, and dTTP, and 5 units Taq poly- 
merase. The reaction mixture is denatured at 94°C for 1 
5 minute, cooled to 50°C for primer annealing, and primer exten- 
sion performed at 72°C for" 5-12 minutes. 

The fragments from the each of the three eluted mixtures 
above are cut with Noti to produce Noti sticky ends in the 
linker regions of the fragments, as can be appreciated from 
10 Figure 2A. The fragments (13 ng/ml) are mixed with 60 >xg/ml 
Not I -digested phage Ch21aLJ, and inserted at the Noti site in 
the phage in the presence of T4 DNA ligase. After encapsula- 
tion, the phage are used to infect E . coll strain KM3«>2 
suppressor-positive strain, and E. coli strain LG75, a sup- 
15 pressor-minus strain. The number of plaques produced in the 
KM392 strain provides a measure of total number. of recombinant 
Phage, since this strain is able to suppress the phage amber 
mutation, and thus allow phage expression of its A and B 
genes. In the LG75 strain, by contrast, only phage which have 
incorporated the P iAN13 fragment- produce plaques, since the 
sup F gene is required for growth of phage in suppressor minus 
bacterial strains. The ratio of plaques on the two host 
strains thus indicates the ratio of P iAN13 fragments in the 
fragment mixtures. As a control, the ratio of fragments in 
the three mixtures before hybridization and streptavidin 
separation is also determined. 
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Example 4 

Screening Ampl ified cDNA Fragment Mixtures 
30 A. Bile-derived cDNAs 

Two cynomolgus monkeys were intravenously injected with 
a 10% suspension of stool positive for ET-NANB obtained from a 
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human volunteer. It was determined the stool was positive for 
ET-NANB by binding of viruslike particles (VLPs) in the stool 
to immune serum from a known ET-NANB patient. The monkeys 
leveloped elevated levels of alanine aminotransferase (ALT) 
5 between 24-36 days after inoculation, and excreted 27-30 nm 
VLPs in their stools in the pre-acute phase of infection. The 
animals became seropositive for VLPs in the inoculum. 

The bile duct of each infected animal was cannulated and 
about 1 cc of bile was collected. RNA was extracted from the 
10 bile by hot phenol extraction, using a standard RNA isolation 
procedure. Double-strand cDNA was formed from the isolated 
RNA by a random primer for first-strand generation, using a 
? cDNA synthesis kit obtained from Boehringer-Mannheim (Indiana- 
polis, iN) . 

15 f A cDNA fragment mixture from non-infected cynomolgus 

'monkey bile was prepared similarly. . 

B. Stool-Derived cDNAs 

cDNA fragments obtained from human stool samples were 

20 prepared as follows: a 10% suspension of stool samples from 
healthy or ET-NANB- infected individuals was layered over a 30% 
sucrose density gradient cushion, and centrifuged at 25 kg for 
6 hr in an SW27 rotor, at 15°C. The pelleted material con- 
tained 27-32 nm VLP particles characteristic of ET-NANB infec- 

25 tion in the infected-stool sample. RNA was isolated from the 
sucrose gradient pellets in both the infected and noninfected 
samples, and the isolated RNA was used to produce cDNA frag- 
ments as described in Example 2 . 

30 C. Amplification of the cDNA Fragment Mixtures 

The cDNA fragment mixtures from infected and non- in- 
fected bile source, and from infected and noninfected human- 
stool source were each amplified by linker/ primer replica- 
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tion, according to the method of the invention. Briefly, the 
fragments in each sample were blunt-ended with DNA Pol \ thel 
exacted with phenol/chloroform and precipitated with et ha- 

2 In! " " " C ° nditions ^ose described in Ex Mpls 

2- and the mxtures were digested with Nrul to remove linLr 
dimers Each mixture was brought to a fina! concentration of 
0.0! „g/ml ln Buffer A , and to th . s _ addfid 2 ^ of a J 

10 each or dATP, dCTP, dGTP, and dTTP . The reaction mixture „a^ 
heated to 94-C for 1 minute for denaturation and allowed to 

to 72 c for 5-12 min to allow for primer extension by Tao 
polymerase. The replication reaction, involving successive" 
15 heating, cooling, and polymerase reaction, was repeated 2 
agonal 25 times with the aid of a Perfcin E lmer Cetus D NA 

thermal cycler. 



O. Screening Amplified cDNA Mixtures 

20 Mr . fOUr amplifled <=°™ foment mixtures from above 

were fractionated by agarose gel electrophoresis, using a 2% 
agarose matrix. After transfer of the- DNA fragments from the 
agarose gels to nitrocellulose paper, the filters were hyb^! 

25 3 ? 1-33 Hb fragment associated with 

ET NANB viral agent. The derivation and sequence of this 
fragment, which is carried in a pTZ-KFl (ET1 . 1 ) piasmid identi- 

dlgest T N °- 67717 • The cloned fra9ment — >y 

digesting the pTZ-KFl (BT1 . a) plasniid with ^ J 

30 primint thl r ^ ^ ,r "*- nt ' (i "> ««- 

u Priming the isolated fragment. 

The probe hybridization with th e fractionated cDN* 
mixtures was performed by conventional Southern blotting 
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methods (Maniatis, pp. 382-389) . Figure 6 shows the hybridi- 
zation pattern obtained with cDNAs from infected and non- 
infected bile sources. As seen, the ET-NANB probe hybridized 
with amplified fragments obtained from the infected source, 
5 but was non-homologous to sequences obtained from the non- 
infected source. Similar results were- obtained with amplified 
fragments obtained from infected and non-infected human stool 
sources. This procedure demonstrates the utility of the 
described invention since hybridization analysis of an equiva- 
10 lent amount of cDNA prior to amplification would not have 
yielded detectable hybridization signal. 

Example 5 

^ Isolation of Genomic DNA from the region adjacent the gene 

15 encoding IL-5 

A. Preparation of Large DNA fragments 

The hybrid cell line HHW105 (Overhauser et al.) was used 
as the source of genomic DNA. This cell line contains all of 
human chromosome 5. The human chromosome is retained by selec- 

20 tion for the leucine aminoacyl tRNA synthetase gene on human 
chromosome 5 which complements a corresponding defective 
hamster gene. This hybrid cell line was employed to obtain an 
immediate enrichment of chromosome 5 sequences. 

The gene encoding IL-5 is located in the 5q31.3 region 

25 of chromosome 5 (LeBeau et al) . Genomic DNA was isolated from 
cell line HHW105 (Maniatis et al, Smith et al) and digested 
with a variety of restriction endonucleases . The resulting 
genomic DNA fragments were size fractionated by Pulse-Field 
Gel Electrophoresis (Carle and Olson) . The DNA fragments in 

30 this gel were then transferee to nitrocellulose (Southern) and 
the nitrocellulose filter hybridized with a random-primed IL-5 
cDNA fragment (Azume et al) . A NotI fragment which hybridized 
with the IL-5 probe was identified. This 500 kilobase (kb) 
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No£I fragment was reduced by about 50 *b upon double digestion 
with Nrul, this Notl/ttrul fragment was chosen as a target 
reg lon for purification and subsequent use in homologous cDNA 
selection. The purification of this fragment is outlined in 
Figure 8 and described below. 

5 X 10 9 cells were harvested and washed once in Dulbec- 
co's Phosphate Buffered Saline (PBS). The cells were resu- 
spended in PBS at 1 x 10° cells/ml to which was added an equal 
volume of L-buffer (50 mM NaCI, 50 mM EDTA, p H =8) and 1% low 



10 



15 



20 



gelling temperature (LGT) agarose (FMC Byproducts) held at 
50 C. Proteinase K was then added to a final concentration of 
0.5 mg/ml. The mixture was gently pipetted into block former 
and the blocks allowed to solidify. These blocks were thin 
Placed in 50 ml conical tubes, totally immersed in 40 mis of 
0 5 M EDTA, P H=9, !% sarkosyl, and 200 ng/ml proteinase K, and 
then incubated overnight at 50°C. The 40 mis of solution was 
then replaced with L-buffer containing PMSF at a final con- 
centration of ImM and incubated for 2 hours at room tempera- 
ture; this wash was repeated. The blocks were then, washed 
twice, with 2 hour incubations for each wash, in TE ' (10 mM 
Trxs HC1, pH =8, 1 mM EDTA) at room temperature. 

B. The First CHEF gel. 

„ , . A 1% LGT agarose ' 0-5 X THE (5 X TBE stock solution, per 
liter: 54 g Tris Base, 27.5 g boric acid, and 20 mis 0.05 M 
EDTA <p H -8)> gel was poured containing 4 conventional wells 
and one long preparative well. Sixteen blocks of HHW105 DN* 
were equilibrated in l x Notl reaction buffer (New England 
Biolabs) for two incubations of one hour each. The equili- 
brated blocks were incubated with Notl - at 37°C, for four 
hours, under the following digestion conditions: 16 blocks in 
15 ml corex tube, 200 ^ 10 X Notl buffer, 20 >* 10 mg/mi 
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acetylated bovine serum albumin (BSA) , 200 NotI enzyme (10 

units per , 1560 jil water. The blocks were then treated 
for 2 hours in 1 mM PMSF . One individual block, lambda DNA 
ladder size markers (C185757, FMC Bioproducts) and Schizosac- 
5 charomyces pombe chromosomal DNA (972, FMC Bioproducts) were 
loaded in the conventional wells. The remaining blocks were 
heated to 65°C, melted and loaded in the molten state into the 
preparative well. The gel was run under standard CHEF gel 
conditions (Smith et al . , Chu et al . ) for 48 hours at 200 V 

10 with 82 second switching intervals. The portion of the gel 
containing the 4 conventional lanes was then cut away from the 
preparative well and stained with ethidium bromide (see Figure 
9-A, First CHEF gel) . Since CHEF gel tracks are straight 
rather than curved, DNA fractions covering size ranges across 

15 "the gel could be collected. The preparative portion of the 
§el was sliced, using a glass cover slip, into. 1-2 mm slices 
covering the size range from approximately 1000 kb to 150 kb 
for a total of 30 slices down the gel. 

20 C. Identification of the IL-5 containing fraction 

Identification of the IL-5 containing fraction was 
achieved by sequence-specific amplification using IL-5 speci- 
fic primers (IL-5-5' = 5 ' -GGGAATTCATGAGGATGCTTCTGCATTTG-3 ' ; 
IL-5-3' = 5' -GGAAGCTTTCAACTTTCTATTATCCACTCGGTGTTCATTAC-3' ) . 

25 DNA was extracted from a 1 mm by 2 mm portion of each slice of 
the gel; the rest of each slice was stored until the fraction 
containing the IL-5-homologous fragment was identified. The 1 
X 2 mm sample was placed in a microfuge tube, heated to 65°C 
until the agarose was molten (approximately 5 minutes) , mixed 

30 gently and a 20 p.1 aliquot was placed on a piece of Parafilm iM 
and allowed to solidify to an agarose bead. The agarose bead 
was placed in 200 |il of 1 X polymerase chain reaction buffer 
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at roo t S Kit, and eguilibrated 

at roo, temperature for 30 minutes. The buffer was then 
removed and the following standard reaction mixture added to 
the agarose bead: !6 „! 1.25 „ ^ „ x polymerase 

chain reaction buffer, 5 ,1 20 1L . 5 . 3 . prlmers , 5 ^ „ 
IL-5-5 , 63. S „1 water. The mixture „ as heated to 65'C until 
the agarose was molten and then 0.S ul of Tag I DNA polymerase 
was added. The mixture was briefly vortexed, . thin ? J ™* 
mineral oil placed on top of the reaction mixture, and the 
10 ampliation reaction run under the following conditions 
denature for 1 minute at 9<TC, anneal for 1 minute at 50-c 
extend for 2 minutes at 72-c, repeat for 25 cycles. Following 
the amplication, a conventional agarose gel was' run tl 

15 LTT" £raCti ° n C ° ntalni ^ IL " 5 «— figure 9, polymer- 

ase chain reaction products); in parallel three control poly- 

zri: t : h :'\ reaction reactions - ere run °- e - h 

* lth IL ' 5 ^rmers, hamster DNA with IL-5 primers, and IL-5 
prxmers alone. An example of a single CHEF gel track of NotI 
digested HHW105 genomic DNA and of the polymerase chain ,re^T 
20 .ion products is shown in Figure 9. As seen in this figure a 
Single DNA sample contained the majority of the IL-5 f ragment , 
this polymerase chain reaction product emigrated with the 
human DNA control polymerase chain reaction product. T J 

25 rlZT Sh °" ed "° Polymerase cha'in 

reaction products. 



30 



D. The Second CHEF Gel 

in order to achieve a further purification of the rele- 

l2\n*T en i- £r ° m cont - 1 «"»9 hamster fragments, the 

DNA in the slice corresponding to the IL-5-positive polymerase 
cha ion product „ as sub . ected te J * £ 

Nrul restriction endonuclease, and size fractionated on a chff 
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gel (as described above for NotI in Section B) . The gel was 
sliced as above and a small aliquot of DNA obtained from each 
slice was subjected to polymerase chain reaction amplification 
with the IL-5-specif ic primers to determine the positive 
5 fractions. The polymerase chain reaction amplification pro- 
ducts were resolved on a conventional agarose gel, the DNA 
molecules transferee to nitrocellulose, and, in this case, the 
IL-5-specif ic fraction was identified by hybridization with a 
radiolabeled IL-5 cDNA probe. The resulting autoradiogram is 
10 shown in Figure 10. 

E. Amplification of the IL-5 positive fraction by sequence- 
independent amp 1 i f i cat i on . 

In order to have sufficient quantities of these genomic 

15 -DNA fragments for use in cDNA library screening and for the 
Construction of a library of these genomic fragments, the 
genomic DNA fragments were amplified using the sequence- in- 
dependent single primer amplification (SISPA) method described 
in Section C. To generate the blunt-ended molecules necessary 

20 for the linker addition, the DNA from the relevant size frac- 
tion which was identified in the second CHEF gel was digested 
with EcoRV, which generates blunt end cuts in the DNA at a 
hexanucleotide recognition site. The digestion products were 
fractionated . on a conventional 0.8% agarose gel with size 

25 markers in parallel. The -gel was rotated through 90° and the 
digested DNA was electroeluted onto a charged membrane filter 
(NA45, Schleicher and Schuell) . The filter was cut into ten 
strips, five corresponding to the positions of DNAs of 2 kb or 
less in size, and five corresponding to DNAs above 2 kb in 

30 size. The DNAs were eluted from the filter in very small 
volumes (elution buffer: 1 M NaCl, 50 mM Arginine. for one hour 
at 65°C) , phenol/chloroform extracted, and precipitated in the 
presence of carrier t-RNA. DNAs above 2 kb in length were 
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subjected to one more set of restriction endonuclease diges- 
tions using Haelll and Rsal; both of these enzymes generate 
blunt ends and recognize four base pair sites. In this way 
the larger DNAs were reduced to lengths that could be., more 
5 efficiently amplified. Each of the ten fractions were ligated 
using T4 DNA ligase (New England Biolabs) to the A linkers 
(Figure 2-A) and were then cut with Nrul to eliminate any 
linker dimers which had been formed in the ligation reaction. 
The volume of the restriction endonuclease digestion reactions 
10 were increased to 100 ul. The fractions were then passed over 
Sephadex™ G-50 (Maniatis et al . , page 4 66). 

The genomic fragments were then amplified, as described 
in Section C and Example 2 using Taq DNA polymerase I (Perkin- 
Elmer Cetus) and the A primer under the following conditions: 
25 jil of the, linkered fraction, 5 ul of 20 uM A primer, 10 ul 
10 X amplification reaction buffer (100 mM Tris HC1, pH=8.3, 
500 mM KC1, 15 mM MgCl 2 , 0.1% (w/v) gelatin), 16 ul 1.25 mM 
dNTPs , 43.5 ul water, 0.5 ul Taq I DNA polymerase. The reac- 
tion mixture was cycled 25 times for the following periods: 
denature at 94°C for 1 minute, anneal at 50°C for 1 minute, and 
extension at 72°C for 2 minutes. An aliquot of each amplified 
fraction was separated on a 1% agarose gel which is shown in 
Figure 11. A smear of amplified products was seen. Carrier 
t-RNA treated in parallel does not amplify. The furthest most 
right hand lane of Figure 11 is the size standard lambda 
Hindi II. The DNA in this gel (the final SISPA gel) was trans- 
ferred to a GeneScreen™ filter (DuPont/New England Nuclear) , 
the DNA cross-linked to the filter by UV, and the filter pre- 
hybridized overnight at 65°C in GeneScreen™ buffer (Church and 
Gilbert) with the addition of 100 ng/ml of sheared herring 
sperm DNA (the final SISPA filter, see Example 7) . 
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An aliquot of the amplified genomic fragments (not bound 
to nitrocellulose) was used for the construction of a fragment 
library . 

5 Example 6 

Preparation of the T-cell cDNA library 
T-cells were separated from whole blood by standard 
methods and the leu3+ fraction specifically selected. The 
leu3+ T-cells were induced with PMA (phorbol myristate ace- 
10 tate) by standard procedures. RNA was extracted from the T- 
cells 6 hours post-induction with PMA, and the poly-A* RNA 
fraction was purified from T-cell total RNA by affinity chro- 
matography on an oligo-dT cellulose column. A leu 3+ T-cell 
cDNA library was prepared essentially as described by Maniatis 
15 el al (pages 229-24 6) . Synthesis of the first strand was 
achieved using an oligo-dT primer and reverse transcriptase. 
Second strand cDNA synthesis was performed by a combination of 
the Klenow fragment of DNA polymerase I, followed by treatment 
with reverse transcriptase. Hairpin' structures were resolved 
20 with Si nuclease. EcoRI linkers were blunt end ligated onto 
the newly synthesized duplex cDNA molecules and these mole- 
cules were then ligated into the lambda gtlO vector for clon- 
ing purposes (Protoclone™ lambda gtlO, Promega) . 

A 20 jig aliquot of the T-cell DNA library DNA was diges- 
25 ted with EcoRI to liberate the cDNA inserts from the phage 
arms (approximately 4% of the total weight is insert cDNA) . 
The EcoRI ends were then converted to blunt ends by filling in 
with dATP and TTP, using the Klenow fragment of DNA polymerase 
I (Maniatis et al., page 113). The use of only dATP and TTP 
30 repairs only the EcoRI sites but not the cos sites of lambda, 
thus avoiding the potential problem of creating blunt end 
termini on both ends of the contaminating phage vector DNA. 
The C linker (Figure 2-C) was then ligated to this_total cDNA 
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library and the ligation mixture digested with Nrul to remove 
dlmfirS - ThS V ° lume ° f the reaction mixture was then increased' 
to 1.0 ml with GeneScreen- buffer, sheared lambda gtlO DNA 
added to a final concentration of 100 ng/ml, and the mixture 
incubated at 65°C for 90 minutes; this mixture is the probe 
used in Example 7. * 

In addition, this preparation of the cDNA library may be 
pre-hybridized to randomly-sheared total human genomic DNA 
before hybridization to the final SISPA filter, to p^e-block 
repetitive sequences present in the cDNA library and thus 
reduce the background/signal noise (see Section ii-b (III) ) 
The cDNA library can be amplified using the C-linker sequenc- 
es an end-terminal primer for amplification (Example; 7) 
Alternatively, before the cDNA molecules are liberated ' fr-om 



10 



15 the lambda gtlO vector, common vector sequences" flanking the 
5' and 3' end of the cDNA inserts can be used as end-terminal 
primers to amplify. the cDNA inserts. 



20 



Example 7 

Hybridization of the T-cell cDNA n brary to the chromc ; some , 

derived genomic DNA 
A. Hybridization and Washing 

The T-cell cDNA library probe constructed in Example 6 
was added to the final SISPA filter prepared in Example 5 in a 
25 total volume of 10 mis of GeneScreen" buffer. The hybridiza- 
tion was carried out for 48 hours at 65'C. The filter was 
then extensively washed under the following conditions: 
1. Two washes with 1 x SSC / 0.1% SDS at room temperature 
for 15 minutes each. 

30 2. two washes with 1 X SSC / 0.1% SDS- at 65°C for 15 min- 
utes each, 

3. Two washes with 0.1 x SSC / 0.1% SDS at 65°C for 15 

minutes each. 
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4. A final wash with 0.1 X SSC / 0.1% SDS at 65°C for 20 
hours . 

B. * Elution of the selected cDNAs from the filter 

5 'After the final wash, the selected cDNAs were eluted 

from the filter by treatment with 5 mis of 50 mM NaOH at room 
temperature. This eluent was neutralized to pH=8 . 0 with 
approximately 2 mis of 1.0 M Tris HC1. The cDNAs were precipi- 
tated with NaCl, yeast RNA, and ethanol, and then resuspended 
10 in 100 fil of water. The resuspended cDNAs were then passed 
over a Sephadex™ G-50 column. 

C. Amplification and Analysis of the selected cDNAs 

T he cDNAs present in the eluent were then amplified 
15 using- the C primer and the following conditions: 30 \xl eluted 
matelial, 10 jil 10 X amplification reaction buffer, 16 jil 1.25 
mM dNTPs, 5 p.1 20 pM C primer, 45 jil water, 0.5 jil Taq I DNA 
polymerase. The amplification reaction was carried out for 25 
cycles as follows: denature at 94°C for 1 minute, anneal at 
20 45°C for 1 minute, and extend at 72°C for 2 minutes. The 
amplification products were run on a conventional agarose gel 
to examine the products (Figure 12) . The DNA in the gel was 
transferee to a nitrocellulose filter and the filter hybri- 
dized with 6 radiolabeled probes: actin, hamster genomic DNA, 
25 human genomic DNA, IL-3 specific (Yang et al . ) , IL-5 specific 
(Azume et al . ) , and human mitochondrial DNA specific (CF14) 
(Figure 13) . 
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Example 8 

Selection of cDN As correspond! nrr t-o -t-v,^ 

>m.ieaponain q to the genomic region 

containing il -3 and GMCSF 
The methods applied to the analysis of IL-5 have also 
been app^d to IL-3. Specific probes for IL-3 (Yang et al ) 
and GM-CSF (Schrader et al, are known. Using an IL-3 specif^ 
Probe the IL-3 containing regibn of chromosome 5 was isolate! 
as described above; a long-range restriction map of this 
region is shown in Figure 15. By prob ing isolated genomic 
clones wxth labelled probes specific for IL-3 and GM-CSF we 
showed a physical linkage between these loci of less than 10 
kxlobases. T-cell cDNAs were then selected on the basis of 
their homology to the selected IL-3 containing region genomic 
DNA reg.cn. Figure 14 shows the hybridization of these selec- 
ted cDNAs with probes specific for IL-3, GM-CSF, and actin, 
and also a total human genomic probe. Specific hybridization 
to IL-3 and GM-CSF was clearly visible. The lack of hybridiza- 
aCt±n UnderSC ° res that the hybridization to IL-3 and 
GM-CSF was specific and the lack of hybridization to total 
numan genomic DNA indicates the absence of large numbers of 
repetitive sequence DNA in the selected cDNAs . 



25 



30 




A. Preparation of the Genomic Bovine Leukosis Virus DNA 

The cloning and sequencing of the bovine leukosis virus 
(BLV) genome have been previously described (Kashmiri et al 
Sagata et al.). Full length raolecular DNA clones Qf ^ - 

responding to the RNA genome of the virus were obtained from 
J.F. Ferrer, University of Pennsylvania, School of Veterinary 
Science. 100 nanograms of BLV DNA was heat denatured to yield 
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single-strands and spotted to a 0.45 micron nitrocellulose 
filter (Schleicher and Schuell) . The filter was then baked 
for 2 hours at 75° under vacuum to fix the DNA to the filter 

(Ricciardi et al . ) . The following solution was used for pre- 
5 hybridization of the filter: 50 mM Tris HC1, pH=7.5; 1 mM 
EDTA; 0,5% SDS; 3 X SSC (20 X SSC stock solution = 175.3 g 
NaCl, 88.2 g sodium citrate, adjusted to pH=7, having a total 
volume of 1 liter of water) ; 1 X FPB (10 X FPB stock solution 
= 0.2% Ficoll, 0.2% polyvinylpyrrolidone, and 0.2% bovine 
10 serum albumin) ; 50% Formamide; and, 100 jig/ml denatured salmon 

sperm DNA. Pre-hybridization was carried out for 20 hours at 

37°C. 

B> Preparation of the Human Lymphoma-associated Virus cDNA 

15 v from ceil line 10C9 

v. 

-I - 10C9 virus particles were purified from 10C9 cell cul- 
ture supernatants which were clarified of cellular debris by 
centrifugation (10,000 X g, 15 minutes at 4°C) , concentrated 
100-fold by Amicon™ ultrafiltration, and pelleted through a 

20 25% sucrose cushion/PBS (100, 000 X g, 4°C for 90 minutes). 
The resultant pellet was resuspended in Dulbecco's Phosphate 
Buffered Saline (PBS) , homogenized, and overlaid on a 15-60% 
sucrose density gradient, centrifuged for 16 hours at 100,000 
X g, 4°C; the virus band at 1.15-1.18 g/ml was collected, 

25 diluted with PBS and pelleted by ultracentrif ugation to remove 
the sucrose (100,000 X g, 4°C for 90 minutes). 

The viral RNA was extracted from the particles by the 
method of Chomzynski et al . and was then used as template in 
random-primed cDNA synthesis (kit obtained from Boerhinger- 

30 Mannheim) . The cDNA molecules were biunt-ended using the 
Klenow fragment of DNA polymerase I and all four deoxyribo- 
nucleotides (Maniatis et al . , page 113). The A linkers (see 
Figure 2-A) were then ligated to the ends of the_cDNA mole- 
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cules using T, DNA U,ass ,Ne„ England Biolabs, . These cDNA 
molecules were then digested with Nrul to eliminate any li„ ke - 
drmers which had been formed in the ligation reaction 
The IOCS, cdna library was then amplified using the sequence- 
mdependent single primer amplif i cat i on m e th od described in 
Examples 2 and 6. n 

C Hybridization of the 10C9 cDNA library to the BLV genome 
The pre-hybridized filter described above was then 
hybrxdxzed for 36 hours at 37°C with the 10C9 cDNA library 
under the following conditions: 50 mM Tris HC1, P H=7.5; 1 mM 

EDTA; 0.5% SDS; 3 X SSC (20 X ^ ^ , • 

" L ^ u x ssc stock solution = 175 3 g 

NaCl, 88.2 g sodium citrate, adjusted to P H=7, having a total 
volume of 1 liter of water); 1 X FPB (10 X FPB stock solution 
= 0.2. Ficoll, 0.2% Polyvinylpyrrolidone, and 0.2% bovine 
serum albumin); 50% Formamide; 10% Dextran Sulfate; and, 100 
ng/ml of the 10C9 amplified cDNA library. 

After the hybridization period the filter was transferee 
20 ;e°rf a or C me e d a : ' "* ^ ^ ^ »^ 

1. 100 ^1 of 2 X SSC and 0.5% SDS , at room temperature for 

fifteen minutes. 
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2 . Repeat step 1 . 

„ 3 ' 100 ^ 2 X SSC and S °S, at 60°C for 30 minutes. 

« -4. Repeat step 3. 

5. 100 m 0.5 X SSC and 0.1% SDS at 60°C for 30 minutes. 

6. Repeat step 5. 

7. Elution of the bound cDNAs was achieved by washing with 

100 ^ ° f °' 1 X SSC a "d 0.1% SDS at 95-100°C for 15 

JU minutes. 

8 . Repeat step 7 . 
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The eluents were run on a conventional agarose gel, 
* transferred to nitrocellulose and hybridized with an IL-5 spe- 
cific probe (Campbell et al; Azume et al) ; elution of IL-5 
specific sequences was seen only in the eluerit of steps 7 and 
5 8 . 

The eluents were amplified as described in Example 7C 
except using the A primer. 

Example 10 

10 Characterization of 10C9 Virus cDNAs homologous to the BLV 

genome 

The amplified selected 10C9 cDNAs present in the eluents 
of Example 9 were then cloned to create two libraries: one in 
lambda gtIO (Protoclone™ lambda gtlO system, Promega) and one 

15 in lambda gtll (Protoclone™ lambda gtll system, Promega) . The 
linkers used in the construction of the amplified cDNAs also 
contained an EcoRI site (see Figure 2) which allowed for 
direct insertion of the amplified cDNAs into both lambda gtlO 
and gtll vectors. 

20 The fusion proteins expressed by the lambda gtll clones 

were screened with antibody probes (Young et al . ) . Screening 
with bovine anti-BLV (a-BLV) detected hundreds of positive 
clones from 4 X 10 4 plaques screened. Screening with a-p24 
detected 15 positive plaques out of 10* screened, these posi- 

25 tives were plaque purified and cross-screened with a- BLV , 4/15 
of these clones were positive with both anti-sera. Screening 
with a-gp62 detected 4 positive clones which were all also 
positive when cross-screened with a-BLV, 

The lambda gtlO cDNA library was screened with a full 

30 length molecular clone of BLV. Plaque hybridization screening 
(Woo) of lambda gtlO clones containing approximately 600 base 
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pair inserts yielded ,6 positives out of 2 X 10" places 

screened, 4 ■ 

While preferred embodiments, uses, and methods of prac- 
thS PreSent i»*-ntlon have been described in detail, it 
WH1 be appreciated that various other uses, and methods of 
practice as indicated herein are within the contemplation of 

the present invention. 
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WHAT IS CLAIMED IS; 

1. A method of cloning cDNA species which are homolo- 
gous to a region of contiguous genomic DNA and are selected 

5 from a mixture of cDNA species, said method comprising, 

(a) obtaining the region of- interest of contiguous 
genomic DNA, 

(b) preparing cDNA species to contain end-terminal 
priming sequences, 

10 (c) isolating single-stranded cDNA species on the basis 

of their hybridization to the region of contiguous genomic 
DNA, 

(d) mixing the isolated single-stranded cDNA species 
with DNA polymerase, all four deoxyribonucleotide triphos- 

15 phates, and primers homologous to the cDNA end-terminal prim- 
ing sequences, 

(e) reacting the mixture under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species, and 

20 (f) cloning the amplified cDNA species into a vector. 

2. The method of claim 1, wherein the region of con- 
tiguous DNA contains a region of interest, and said obtaining 
includes (a) size fractionating a DNA section Containing the 

25 region of interest, (b) identifying the fraction containing 
the region, (c) isolating the DNA from the identified frac- 
tion, and (d) digesting the isolated DNA with a restriction 
endonuclease, to cut the region of interest into a plurality 
of smaller fragments . 

30 
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, h ^ meth ° d ° f Claim 2 ' herein the DNA containing 

the region of interest is identified in the fraction by use of 
primer-initiated amplif ication in the presence Qf 
which are homologous region. 

4. The method of claim 1, wherein the contiguous geno- 
mic DNA section is bound to a solid support. 
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The method of claim 1, wherein the contiguous geno- 
mic DNA section is contained in yeast artificial chromosomes. 

6. The method of claim 1, wherein the contiguous geno- 
mic DNA section represents the bovine leukosis virus genome 

15 H t miXtUre ° f CDNA SPSCieS 15 3 CDNA lib *« Y ^om 
15 mRNA obtained from a cell line infected with a ^ 

from cell line 10C9. 



which ° f ° laim ^ ±n ^"fying cDNAs 

wnich correspond to genes located in the same chromosomal 
region as a known gene, wherein fragments containing said 
contiguous genomic DNA region are obtained by, 

(a) preparative size fractionating ' of genomic DNA 
fragments containing the known gene, 

(b) ligating linkers to the genomic DNA fragments, 
containing the known gene, which are useful as primers for 
sequence-independent amplification, 

(O mixing the DNA fragments with DNA polymerase, all 
.our aeoxyribonucleotides, and prime rs homologous to the 
linkers present on the ends of the DNA fragments, and ■ 

(d) reacting the mixture under conditions to produce 
sequence-independent amplification of the DNA fragments 
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8. The method of claim 7, wherein the preparative size 
fractionating of genomic DNA fragments containing the known 
gene involves identification of the DNA fragments of interest 
by adding primers homologous to the known gene, DNA polymer- 
5 ase, and all four deoxyribonucleotides, to the gel matrix and 
treating the gel matrix under conditions which promote amplif- 
ication of the region of the known gene defined by the pri- 
mers . 

10 9. The method of claim 1, wherein the known gene is a 

growth factor or growth factor receptor gene. 

10. The method of claim 9, wherein the known gene is an 
interleukin . 

15 " 

11. The method of claim 10, wherein the known gene is 
IL-5 and the mixture of cDNA species is a T-cell cDNA library. 

12. The method of claim 9, wherein the known gene is 
20 erythropoietin and the mixture of cDNA species is a renal cDNA 

library . 

13. The method of claim 1, wherein said preparing of 
cDNA species to contain end-terminal priming sequences further 

25 includes the amplification of the cDNA species by mixing the 
single-stranded cDNA species with DNA polymerase, all four 
deoxyribonucleotide triphosphates, and primers homologous to 
the cDNA end-terminal priming sequences , and reacting the 
mixture under conditions to produce sequence-independent 

30 amplification of the single-stranded cDNA species. 
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14. The method of claim 1, wherein said preparing of 
cDNA species to contain end-terminal priming sequences in- 
cludes the cloning of the cDNA species into a suitable vector 
and using the known 5' and 3' vector sequences, which flank 
the cDNA insert, as end-terminal priming sequences. 



5 



15. The method of claim 14, wherein said preparing 
further includes the amplification of the cDNA species by 
mixing the single-stranded cDNA species with DNA polymerase, 
10 all four deoxyribonucleotide triphosphates, and primers homol^ 
ogous to the cDNA end-terminal priming sequences, and reacting 
the mixture under conditions to produce sequence -independent 
amplification of the single-stranded cDNA species. 

15 16. A method of cloning cDNA species which are homolo- 

gous to a region of contiguous genomic DNA and are selected 
from a mixture of cDNA species, said method comprising, 

(a) preparing cDNA species to contain end-terminal 
priming sequences, 

20 (b) isolating single-stranded cDNA species on the basis 

of their hybridization to a first set of selected genomic 
fragments, 

(c) mixing the isolated single-stranded cDNA species 
with DNA polymerase, all four deoxyribonucleotide triphos- 
phates, and primers homologous to the cDNA end-terminal prim- 
ing sequences, 

(d) reacting the mixture under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species, 

30 (e) isolating single-stranded cDNA species, from the 

mixture of step (d) , on the basis of their hybridization to a 
second set of selected genomic fragments, 



25 
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(f) mixing the isolated single-stranded cDNA species of 
step (e) with DNA polymerase, all four deoxyribonucleotide 
triphosphates, and primers homologous to the cDNA end-terminal 
priming sequences, 
5 . (g) reacting the mixture under conditions to produce 

sequence-independent amplification of the single-stranded cDNA 
species, 

(h) cloning the amplified cDNA species into a vector. 
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